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BACILLUS STEAROTHERMOPHILUS TAU, DELTA, AND DELTA PRIME 
POLYMERASE SUBUNITS AND USE THEREOF 

The present application is a continuation of U.S. Patent Application 
5 Serial No. 09/716,964, filed November 21, 2000, which is a continuation-in-part of 

U.S. Patent Application Serial No. 09/642,218, filed August 18, 2000, as a 
continuation of U.S. Patent Application Serial No. 09/057,416 filed April 8, 1998, 
which claims the benefit of U.S. Provisional Patent Application Serial No. 60/043,202 
filed April 8, 1997, all of which are hereby incorporated by reference in their entirety. 
10 The present invention was made with funding fi-om National Institutes 

of Health Grant No. GM38839. The United States Government may have certain 
rights in this invention. 

FIELD OF THE INVENTION 

15 The present invention relates to thermostable DNA polymerases and, 

more particularly, to such polymerases as can serve as chromosomal replicases and 
are derived fi"om thermophilic bacteria. More particularly, the invention extends to 
DNA polymerase Ill-type enzymes fi"om thermophilic bacteria, including Aquifex 
aeolicus, Thermus thermophilus, Thermotoga maritima, and Bacillus 

20 stearothermophilus, as well as purified, recombinant or non-recombinant subunits 

thereof and their use, and to isolated DNA coding for such polymerases and their 
subunits. Such DNA is obtained fi*om the respective genes (e.g., dnaX, holA, holB, 
dnaA, dnaN, dnaQ, dnaE^ ssb, etc.) of various thermophilic eubacteria, including but 
not limited to Thermus thermophilus, Aquifex aeolicus, Thermotoga maritima, and 

25 Bacillus stearothermophilus. 



BACKGROUND OF THE INVENTION 
Thermostable DNA polymerases have been disclosed previously as set 
forth in U.S. Patent No. 5,192,674 to Oshima et al., U.S. Patent Nos. 5,322,785 and 
30 5,352,778 to Comb et al., U.S. Patent No. 5,545,552 to Mathur, and others. All of the 

noted references recite the use of polymerases as important catalytic tools in the 
practice of molecular cloning techniques such as polymerase chain reaction (PCR). 
Each of the references states that a drawback of the extant polymerases are their 



limited thennostability, and consequent useful life in the participation in PCR. Such 
limitations also manifest themselves in the inability to obtain extended lengths of 
nucleotides, and in the instance of Tag polymerase, the lack of 3* to 5' «cohuclease 
activity, and the drawback of the inability to excise misinserted nucleotides (Porino, 
1990). ^ 

More generally, such polymerases, including those disclosed in the 
referenced patents, are of the Polymerase I variety as they are often 90-95kpa in size 
and may have 5' to 3' exonuclease activity; They defme a single subunit with 
concomitant limits on their ability to hasten Ae amplification process and to promote 
the rapid preparation of longer strands of DNA. 

Chromosomal replicases are composed of several subimits in all 
organisms (Komberg and Baker, 1992). In keeping with the need to replicate long 
chromosomes, replicases are rapid and highly processive multiprotein machines. 
Cellular replicases are classically comprised of three componaits: a clamp, a clamp 
loader, and tiie DNA polymerase (reviewed in Kehnan and O'Donnell, 1 995; 
McHeniy, 1991). For purposes of the present invention, the foregoing comiponents 
also serve as a broad definition of a "PolIII-type enzyme". 

DNA polymerase III holoenzyme (Pol III holoenzyme) is the 
multi-subunit replicase of the £. coU chromosome^ Pol III holoenzyme is 
distinguished from Pol I type DNA polymerases by its high processivitjr (>50 kbp) 
and rapid rate of synthesis (750 nts/s) (reviewed in Komberg and Baker, 1992; 
Kelman and O'Donnell, 1 995). The high processivity and speed is rooted in a ring 
shaped subunit, called B, that encircles DNA and slides along it while tethering the 
Pol III holoenzyme to the template (Stukenberg et al., 1991; Kong et al., 1992). The 
ring sh^ed P clamp is assembled aroimd DNA by the multisubvmit clamp loader, 
called y complex. The y complex couples the eno-gy of ATP hydrolysis to the 
assembly of the B clamp onto DNA. This y complex, which fimctions as a clamp 
loader, is an integral component of the Pol III holoenzyme particle. A brief overview 
of the organization of subunits within the holoenzyme and their function follows: . 

Pol III holoenzyme consists of 10 different submits, some of which are 
present in multiple copies for a total of 1 8 polypeptide chains (Onrust et al., 1 995). 
The organization of these subunits in the holoenzyme particle is illustrated in Fig. 1. 
As depicted in the diagram, the subunits of the holoenzyme can be grouped 
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functionally into three components: 1) the DNA polymerase III core is the catalytic 
unit and consists of the a (DNA polymerase), e (3'-5' exonuclease), and e subunits 
(McHenry and Crow, 1979), 2) the B "sliding clamp" is the ring shaped protein that 
secures the core polymerase to DNA for processivity (Kong et al., 1992), and 3) the 5 
5 protein 7 Complex (y56'xv|/) is the "clamp loader" that couples ATP hydrolysis to 

assembly of B clamps around DNA (O'Donnell, 1 987; Maki et al.^ 1988). A dimer of 
the t subunit acts as a "macromole«?ular organizer" holding together two molecules of 
core (Studwell-Vaughan and G'Donnell, 1991; Low et al., 1976) and one molecule of 
Y complex forming the Pol III* subassembly (Onrust et al., 1995). this organizing 
10 role of T to form Pol III* is indicated in the center of Fig. 1 . Two B dimers associate 
with the two cores within Pol III* to form the holoenzyme, which is capable of 
repUcating both strands of duplex DNA simultaneously (Maki et al., 1988). 

The DNA polymerase III holoenzyme assembles onto a primed 
template in two distinct steps. In the fitst step, the y complex assembles the B clamp 
15 ontotheDNA. The y complex and the core polymerase utilize the same suifece of the 
B ring aiid they cannot both utilize it at the same time (Naktinis et al., 1996). Hence, 
in the second step the y complex moves away from B thus allo>ying access of the core 
polymerase to the B clamp for processive DNA synthesis. The y complex and core 
remain attached to each other during this switching process by the T subunit organizer. 
20 They complexconsistsof 5 different subunits (y2.48i5'iXiVi)- An 

overview of the mechanism of the clamp loading process follows. The 5 subunit is 
the major touch point to the B clamp arid leads to ring opening, but 5 is buried within 
y complM such that contact with B is prevented (Naktinis et al., 1995). The y subunit 
is the ATP interactive protein but is not an ATPase by itself (Tsuchihashi and 
25 Komberg, 1 989). The 5' subunit bridges the 5 and y subunits resulting in a y55' 

complex that exhibits DNA dependent ATPase activity and is competent to assemble 
clamps on DNA (Onrust et al., 1991). Upon binding of ATP to y, a change in the 
conformation of the complex exposes 5 for interaction with B (Naktinis et al., 1 995). 
The fimction of the smaller subunits, x and m;, is to contact SSB (through x) thus 
30 promoting clamp assembly and high processivity during replication (Kelman and 
O'Donnell, 1995). 
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The three component Pol Ill-type enzyme in eukaiy otes contains a 
clamp that has the same shape as E: coli B, but instead of a homodimer it is a 
heterotrimer. This heterotrimeric ring, called PCNA (proliferating cell nuclear 
antigen), has 6 demons like B, but instead of each PCNA monomer being composed 
5 of 3 domains and dimerizing to form a 6 domain ring (e.g., like B), the PCNA 
monomer has 2 domains and it trimerizes to form a 6 domain ring (Krishna et al., 
1994; Kuriyan arid O'Donnell, 1 993). The chain fold of the domains are the same in 
prokaryotes (B) and eiikaryotes (PCNA); thus, the rings have the same overall 

6-domain ring shape. The clamp loader of the eukaryotic Pol Ill-type replicase is 
10 caUed RFC (Replication factor C) and it consists of subunits having homology to the 
Y and 5' subunits of the E. coli y complex (CuUmann et al., 1995). The eukaryotic 
DNA polymerase Ill-type enzyme contains either of two DNA polymerases, DNA 
polymerase 8 and DNA polymerase e (Bambara and Jessee, 1 991 ; Linn, 1991; 
Sugino, 1995). It is entirely conceivable that yet other types of DNA polymerases can 

1 5 function with either a PCNA or B clamp to form a Pol Ill-type enzyme (for example, 
DNA polymerase II of E. coli functions with the B subunit placed onto DNA by the y 
complex clamp loader) (Hughes et al., 1991; Bonner et al., 1992). The bacteriophage 
T4 also utilizes a Pol Ill-type 3-component replicase. The clamp is a homotrimer like 
PCNA, called gene 45 protein (Young et al., 1 992). The gene 45 protein forms the 

20 same 6-domain ring structure as Band PCNA (Moarefietal., 2000). The clamp 

loader is a complex of two subunits called the gene 44/62 protein complex. The DNA 
polymerase is the gene 43 protein and it is stimulated by the gene 45 sliding clamp 
when it is assembled onto DNA by the 44/62 protein clamp loader. The Pol Ill-type 
enzyme may be either bound together into one particle (e.g.. E. coli Pol III 

25 holoenzyme), or its three components may function separately (like the eukaryotic Pol 

Ill-type replicases). 

There is an early report on separation of three DNA polymerases from 
T. th. cells, however each polymerase form was reminiscent of the preexistmg types of 
DNA polymerase isolated from thermophiles in that each polymerase was in the 
.30 1 10,000-120,000 range and lacked 3'-5' exonuclease activity (Ruttimann et al., 1 985). 
These are well below the molecular weight of Pol Ill-type complexes that contain in 
addition to the DNA polymerase subunit, other subunits such as y and x. Although the 
three polymerases displayed some differences in activity (column elution behavior. 



and optimum divalent cation, template, and temperatuifes) it seems likely lliiat these 
three forms were either different repair type polymerases or derivatives of one repair 
enzyme (e.g., Pol I) that was modified by post translational modification(s) that 
altered their properties (e.g. phosphorylation, methylation, proteolytic clipping of 
residues that alter activity, or association with different ligands such as a small protein 
or contaminating DNA). Despite this previous work, it remained to be demonstrated 
that thennophiles harbor a Pol Ill-type enzyme that contain multiple subunits such as 
Y arid/or t, functioned with a sliding clamp accessory protein, or could extend a 
primer rapidly and pfocessively over a long stretch (>5kb) of ssDNA (Ruttimann et 
al., 1985). 

Previously, it was not known what polymerase thenriophilic bacteria 
used to replicate their chromosome since only Pol I type enzymes have been reported 
from thermophiles. By distinction, chromosomal replicases, such as Polymerase HI, 
identified in E: coli, if available in a thermostable bacterium, with all its accessory 
subunits, could provide a great improvement over the Polymerase I type enzymes, in 
that they are generally much more efficient - about 5 times faster - and much more 
highly processive. Hence, one may expect faster and longer chain production in PCR, 
and higher quality of DNA sequencing ladders. Clearly, the ability to practice such 
synthetic techniques as PCR would be enhanced by these methods disclosed for how 
to obtain genes and subunits of DNA polymerase III holoenzyme fi-om thermophilic 
sources. . 

The present invention is directed to achieving these objectives and 
overcoming the various deficiencies in the art. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, DNA Polyiherasp Ill-type 
enzymes as defined herein are disclosed that may be isolated and purified from a 
thermophilic bacterial source, that displiay rapid synthesis characteristic of a 
chromosomal replicase, and that possesses all of the structural and processive 
advantages sought and recited above. More particularly, the invention extends to 
thermostable Polymerase Ill-type enzymes derived from thermophilic bacteria that 
exhibit the ability to extend a primer over a long stretch (>5kb) of ssDNA at elevated 



temperature, the ability to be stimulated by a cogaate sliding clamp (e.g., p) of the 
type that is assembled on DNA by a 'clamp' loader (e.g., y complex), and have clanap 
loadmg subunits that show DNA stimulated ATPase activity at elevated temperature 
and/or ionic strength. Representative thermophile polymerases include those isolated 
from the thermophilic eubacteria Aquifex aeolicus {A.ae. polymerase) and other 
members of the Aquifex genus; Thermus thermophilus (T. th. polymerase), Thermus 
favus {Jfi/Tub polymerase), Thermus ruber {Tru polymerase), Thermus brockianus 
(DYNAZYME™ polymerase), and other members of the Thermus genus; Bacillus 
stearothermpphilus {B.st. polymerase) and other members of the Bacillus genus; 
77»€rmoi?/ar»ia acirfoj7/ji7«/w (Tac polymerase) and other members of the 
7%ermoj7/ajOTa genus; aind 7%ermprpga /iea/7o//to«fl (7>ie polymerase; see WO 
96/1 0640 to Chatterjee et al.), Thermotoga maritima {Tma polymerase; seis U.S. 
Patent No. 5,374,553 to Gelfand et al.), and other species of the Thermotoga genus 
(75/7 polymerase), hi a preferred embodiment, the thermophilic bacteria comprise 
species of Aquifex, Thermus, Bacillus, and Thermotoga, and particularly ^.ac, T.th., 
B.st., and Tma. 

A particular Polymerase Ill-type enzyme in accordance with the 
invention may include at least one of the following sub-units: 

A. a y subunit having an amino acid sequence corresponding to 
SEQ. ID. Nos. 4 or 5 (7-./^.); 

B. ax subxmit having an amino acid sequence corresponding to 
SEQ. ID. No. 2 iT.th.), SEQ. ID. No. 120 (A.ae.), SEQ. ID. No. 142 {Tma.) or SEQ. 
ID. No. 182 (B.k); 

C. a 8 subvmit having an amino acid sequence corresponding to 
SEQ. ID. No. 95 (T.th.), SEQ. It). No. 128 (A.ae:), or SEQ. ID. No. 140 (T.ma.); 

. D. a a subunit. including an amino acid sequence corresponding to 
SEQ. ID. No. 87 (T.th.), SEQ. ID. No! 1 18 (A.ae.), SEQ. ID. No. 138 (T.ma.), or 
SEQ. ID. Nos. 1 84 (PolC which has both a and e activity, B.st.); 

E. a fi subunit having an amino acid sequence corresponding to 
SEQ. ID. No. \07 (T.th.), SEQ. ID. No. 122 (A.ae.), SEQ. ID. No. 144 (T.ma.), or 
SEQ. ID. No. 174(5. J/.); 



F. a 5 subimit having an amino acid sequence corresponding to 
SEQ. ID. No. 158 IT.th.), SEQ. ID. No. 124 (A.ae.), SEQ. ID. No. 146 {T.ma.) or 
SEQ.ip.No. 178(S.5f.); 

G. a 6' subvinit having an amino acid sequence corresponding to 
SEQ. ID. No. 156 (T.th.), SEQ. ID. No. 126 .(^.ae.), SEQ. ID. No. 148 (r.wo.) or 
SEQ. ID. N0. 180(5.^0; 

variants, including allelic vari^ts, muteiiis, analogs and fragments of 
any of subparts (A) through (G), and compatible combiniations thereof, capable of. 
functioning in DNA amplification and sequencing. 

The invention also extends to the genes that correspond to and can 
code on expression for the subunits set forth above, and accordingly includes the 
following: dnaX, hoLA, holB, dnaQ, dnaE, dmN, and ssby as well as conserved 
variants and active fi-agments thereof. 

Accordingly, the Polymerase Ill-type enzyme of the present invention 
comprises at least one gene encoding a subvmit thereof, which gene is selected from 
the groiq) consisting of dnaX, hqlA, holB, dnaQ, dnaE and dnaN, and combinations 
thereof. More particularly, the invention extends to the nucleic acid molecule 
encoding the y aiid x subunits, and includes the dnaX gene which has a nucleotide 
sequence as set forth hsxioy as well as conserved variants, active fragments and 
analogs thereof. Likewise, the nucleotide sequences encoding the a subimit {dnaE 
gene), Ae 8 subunit (i/na^gene), the p subunit (rfwoA^ gene), the 6 subunit (/loW 
gene), and the 6* subimit {holB gene) each comprise the nucleotide sequences as set 
forth herein, as well as conserved variants, active fragments and analogs thereof. 
Those nucleotide sequences for T.th. are as follows: dnaXiSEQ. ID. No. 3)* dnaE 
(SEQ. ID. No. 86), dnaQiS^Q. ID. No. 94), dnaNiSBQ. ID. No. 106),7ioM (SEQ. 
ID. No. 157), and holB (SEQ. ID. No. 1 55). Those nucleotide sequences for ^.ae.are 
as follows: dnaX{SW^.\T>. No. 1 19), dnaE (SEQ. ID. No. 1 17), dnaQ (SEQ. ID. No. 
127), d«aA^(SEQ. ID. No. 121), holA (SEQ. ID. No. 123), and holB (SEQ. ID. No. 
125). Those nucleotide sequences for T.ma. are as follows: dnaX{SEQ. ID. No. 141), 
dnaEiSEQ' ID. No. \3T),dnaQ (SEQ. ID. No. 139), i/woAT (SEQ. ID. No. 143), holA 
(SEQ. ID. No. 145), and holB (SEQ. ID. No. 147). Those nucleotide sequences for 
B.St, are as follows: dnaX(SBQ. ID. No. \%\\polC (SEQ. ID. Nos. 1 83), dnaN 
(SEQ. ID. No. 173), holA (SEQ. ID. No. 177), and holB (SEQ. ID. No. 179). 
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. . The invention also provides methods and products for identifying, 
isolating and cloning DNA molecules which encode such accessory subunits encoded 
by the recited genes of the DNA polymerase Ill-type enzyme hereof. 

Yet further, the invention extends to Polymerase Ill-type enzymes 
5 prepared by tiie purification of an extract taken from, e.g., tiie particular tfaermophile 
vinder examination, treated with appropriate solvents and then subjected to 
chromatographic separation on, e.g:, an anion exchange column, followed by analysis 
of long chain syntiietic ability or Western analysis of the respective peaks against 
antibody to at least one of the anticipated enzyme subunits to confirm presence of Pol 
10 III, and thereafter, peptide sequencing of subunits that co purify and amplification to 
. obtain the putative geiie and its encoded enzyme. 

The present invention also relates to recombinant y, x, e, a (as well as 
PolC), 6, 5' and B subunits and SSB from thermophiles. In the instance of the y and x 
subuiiits of T. th., thie invention includes the characterization of a frameshifting 
15 sequence that is internal to the gene and specifies relative abundance of the y and x 
gene products of T th. dnaX. From this characterization, expression of either one of 
the subunits can be increased at the expense of the other (i.e. mutant fraiheshift could 
make all x, simple reclpning at the end of the frameshift could make exclusively ir and 
nbx). 

20 In a further aspect of the present invention, DNA probes can be 

constructed from the DNA sequences coding for, e.g., the T.th.,A.ae., T.ma., ox B.st. 
dnaX, dnaQ, dnaE, dnaA, dnaN, holA, holB, and ssb genes, conserved variants and 
active fragments thereof, all as defined herein, and may be used to identify and isolate 
the corresponding genes coding for the subunits of DNA polymerase III holoenzyme 
25 from other thermophiles, such as those listed earlier herein. Accordingly j all 

chromosomal replicases (DNA Polymerase lll-type) firom thermophilic sources are 

contemplated and included herein. 

The invention also extends to methods for identifying Polymerase Ill- 
type enzymes by use of the techniques of long-chain extension and elucidiation of 
30 subunits with antibodies, as described herein and with reference to tiie examples. 

The invention further extends to the isolated and purified DNA 

Polymerase III from T.th.,A.ae., T.ma., and B.st., the amino acid sequences of the y, x, 
e, a (as well as PolC), 8, 5', and 13 subunits and SSB, as set forth herein, and the 



nucleotide sequences of the corresponding genes from T.th.,A.ae,, T.ma, or B.st, set 
forth herein, as well as to active fragments thereof, oligonucleotides and probes 
prepared or derived therefrom and the transformed cells that may be likewise 
prepared. Accordingly, the invention comprises the individual subunits enumerated 
above andhereinafter, corresponding isolated polynucleotides and respective amino 
acid sequences for each of the y, x, 8, a (as well as PolC), 6, 5', and B subunits and 
SSB, and to conserved variants, fragments, and the like, as vvell as to methods of their 
preparation and use in DNA amplification and sequencing, hi a particular 
embodiment, the mvention extends to vectors for the? expression of the subunit genes 
of the present invention. 

The invention also includes methods for the preparation of the DNA 
Polymerase Ill-type enzymes and the corresponding subimit genes of the present 
invention, and to the use of the enzymes and constructs having active fragments 
thereof, in the preparation, recohstitution or modification of like enzymes, as well as 
in amplification and sequencing of DNA by methods such as PGR, arid like protocols, 
and to the DNA molecules amplified and sequenced by such riiethods. In this regard, 
a Pol Ill-type enzyme that is reconstituted in the absence of 8, or using a mutated e 
with less 3'-5' exoriuclease activity, may be a superior enzyme in either PGR or DNA 
sequencing applications, (e.g. Tabor et al., 1995). 

The invention is directed to methods for amplifying and sequencing a 
DNA molecule, particularly via the polymerase chain reaction (PGR), using the 
present DNA polymerase Ill-type enzymes or complexes. In particular, the invention 
extends to methods of amplifying and sequencing of DNA using theimostable pel Ill- 
type enzyme complexes isolated from thermophilic bacteria such as Thermotoga and 
Thermus species, or recombinant thermostable enzymes. The invention also provides 
amplified DNA molecules made by the methods of the invention, and kits for 
amplifying or sequencing a DNA molecule by the methods of the invention. 

In this connection, the invention extends to methods for amplification 
of DNA that can achieve long chain extension of primed DNA, as by the application 
and use of Polymerase Ill-type eiizymes of the present inventipn. An illustration of 
such inethpds is presented in Examples 1 5 and 16, infra. 

Likewise, kits for amplification and sequencing of such DNA 
molecules are included, which kits contain the enzymes of the present invention. 
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including subunits thereof, together with other necessary or desirable reagents and 
materials, and directions for use. The details of the practice of the invention as set 
forth above and later on herein, and with reference to the patents and literature cited 
herein, are all expressly incorporated herein by reference and made a part hereof. 

As stated, and in accordance with a principal obj ect of the present 
invention. Polymerase Ill-type enzymes and their sub-units are provided that are 
derived from themiophiles and that are adapted to participate in improved DNA 
amplification and sequencing techniques, and the consequent ability to prepare larger 
DNA strands more rapidly and accurately. 

It is a further object of the present invention to provide DNA 
molecules that are amplified and sequenced using the Polymerase Ill-type enzymes 
hereof. 

It is a still further object of the present invention to provide enzymes 
and corresponding methods for amplification and sequencing of DNA that can be 
practiced without the participation of the clamp-loading component of the enzyme. 

It is a still further object of the present invention to provide kits and 
other assemblies of materials for the practice of the methods of amplification and 
sequencing as aforesaid, that include and use the DNA polymerase Ill-type enzynries 
herein as part thereof . 

One goal of this invention is to fully reconstitute the rapid and 
processive replicase from an extreme thermophilic eubacterium from fully 
recombinant protein subunits. One might think that the extreme heat in which these 
bacteria grow may have resulted in a completely different solution to the problem of 
chromosome replication. Prior to filing of the previously-identified priority 
applications, it is believed that Pol III had not been identified in any thermophile until 
the present inventors found that Thermus thermophilus, which grows at a rather high 
temperature of 70-80°C, would appear to contain a Pol III. Subsequent to this 
invention, the genome sequence of A. aeolicus was published which shows dnaE^ 
dndN^ and JwoA" genes. However, previous work did not fully reconstitute the 
working replication machinery from fiilly recombinant subunits. A holA gene and 
holB has not been identified previously in T. thermophilus or A. aeolicus^ and studies 
in the £. coli system show that delta and delta prime, encoded by holA and holB, 
. respectively, are essential to loading the beta clamp onto DNA and, thus, is essential 



for rapid and processiye holoenzyme function (U.S. Patent Nos. 5,583,026 and 
5,668,004 to O'Donnell, which are hereby incorporated by referenc^^ 

Tliis invention fully reconstitutes a ftuictional DNA polymerase 
holoenzyme from the extreme thermophiles Thermus thermophilus and Aquifex 
aeolicus. Aquifex aeolicus grows at an even higher temperature than Thermus 
thermophilus, up to 85°C. In this invention, the genes of Thermus thermophilus, 
Aquifex aeolicus, Thermotoga maritima, and Bacillus stearothermophilus that are 
necessary to reconstitute the complete DNA polymerase III machinery, which acts as 
a rapid and processive polymerase, are identified. Indeed, a delta prime QiolB) and 
delta (/loM) subunits are needed. 

i\i(^dnaE,drtaN,dnaX,driaQ,h6U,zndiholB%en^ 
express and purify the protein "gears", arid the proteins are used to reassemble the 
replication machme. The T th. Pol III is similar to E. coli. The Amc. Pol III is slightly 
dissimilar from the machinery of previously studied replicases. Thie A.ae. rfwoA^gene 
encoded only one protein, tau, and in this fashion is similar to the dnaXofihsi gram 
positive organism. Staphylococcus aureus. In contrast, the dnaX of Uti^ gram negative 
cell, E. coli, produces two proteins. The Aquifex aeolicus polymerase subunit, alpha 
(encoded by driaE) does not contain the 3-5' proofreading exonuclease. In this 
regard, A. aeolicus is similar to E, coli, but dissimilar to the replicase of the gram 
positive organisms. In Gram positive organisms, the PolC polymerase subimit of the 
replicase contains the exonuclease activity in the same polypeptide chain as the 
polymerase (Low et al„ 1976; Barnes et al., 1994; Pacitti et al., 1995). Further, the 
polymerase III of thermophilic bacteria retains activity at high temperature. 

Thermostable rapid and processive three component DNA polymerases 
can be applied to several important uses. DNA polymerases cxirrently in use for DNA 
sequencing and DNA amplification use enzymes that are much slower and thus could 
be improved upon. This is especially true of amplification as the three component 
polymerase is capable of speed and high processivity making possible amplification 
of very long (tens of Kb to Mb) lengths of DNA in a time-efficient manner. These 
three component polymerases also function in conjunction with a replicative helicase 
(DnaB), and thus are capable of amplification at a single temperature, using the 
helicase to melt the DNA duplex. This property could be usefiil in some methods of 
amplification, and in polymerase chain reaction (PGR) methodology. For example, 
the axSSVp form of the E. coli DNA polymerase III holoenzyme has been shown to 
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function in both DNA sequencing and PGR (U.S. Patent Nos. 5,583,026 and 
5,668,004 to O'Donnell). 

Other objects and advantages will become apparent from a review of 
the ensuing description which proceeds with reference to the following illustrative 
drawings. 

DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a schematic depiction of the structure and components of 
enzymes of the general family to which the enzymes of the present invention belong. 

FIGURE 2 is an alignment of the N-terminal regions of E. coli (SEQ. 
ID. No, 19) and A subtilis (SEQ. ID. No. 20) dnaX gene product. Asterisks indicate 
identities. The ATP binding consensus sequence is indicated. The two regions used 
for PGR primer design are shown in bold. 

FIGURE 3 is an image showing the Southern analysis of 7! 
thermophilus genomic DNA. Genomic DNA was analyzed for presence of the dnaZ 
gene using the PGR radiolabeled probe. Enzymes used for digestion are shown above 
each lane. The numbering to the right corresponds to the length of DNA fragments 
(kb). 

FIGURES 4 A and 4B depict the full sequence of the dnaX gene of T 
thermophilus. DNA sequence (upper case, and corresponding to SEQ ID No. 1) aiid 
predicted amino acid sequence (lower case, and corresponding to SEQ ID No. 2) 
yields a 529 amino acid protein (x) of 58.0 kDa, A putative frameshifting sequence 
containing several A residues 1478-1486 (underlined) may produce a smaller protein 
(y) of 49.8 kDa. The potential Shine-Dalgamo (S.D.) signal is bold and underlined. 
The start codon is in bold, and the stop codon for x is marked by an asterisk. The 
potential stop codon for y is shown in bold after the framesWft site, and two potential 
Shine-Dalgamo sequences upstream of the frameshift site are indicated. Sequences of 
the primers used for PGR are shown in italics above the nucleotide sequence of dnaX. 
The ATP binding site is indicated, and the asterisks above the four Gys residues near 
the ATP site indicate the putative Zr?^ finger. The proline rich area is indicated 
above the sequence. Numbering of the nucleotide sequence is presented to the right. 
Numbering of the amino acid sequence of t is shown in parenthesis to the right. 



-13- 

FIGURE 4C depicts the isolated DN A coding sequence fo^ 
gene (also present in FIGURES 3 A and 3B) in accordance with the invention, which 
corresponds to SEQ. ID. No. 3. 

FIGURE 4D depicts the polypeptide sequence of the y subunit of the 
5 Polymerase III of the present invention, which corresponds to SEQ. ID. No. 4. 

FIGURE 4E depicts the polypeptide sequence of the y subunit of the 

Polymerase III of the present invention defined by a -1 frameshift, which corresponds 

to SEQ. ID. No. 4. 

FIGURE 4F depicts the polypeptide sequence of the y subvmit of the 

10 Polymerase III of the present invention defined by a -2 frameshift, which corresponds 
to SEQ. ID. No. 5. 

FIGURES 5A-B are alignments of the y/x ATP binding domains for 
different bacteria. Dots indicate those residues that are identical to the E. coli dnaX 
sequence. The ATP consensus site is imderlined, and the conserved cysteine residues 

15 that form the zinc finger are indicated with asterisks. E. coli, Escherichia coli (SEQ. 
ID, No. 21); H. inf.^ Haemophilus influenzae (SEQ. ID. No, 22); J5. sub.. Bacillus 
subtilis (SEQ. ID. No. 23); C. ores,, Caulobacter crescentus (SEQ. ID. No. 24); M 
gen,. Mycoplasma genitalium (SEQ. ID. No- 25); T./Zi., Thermus thermophilus {SEQ. 
ID. No. 26). Aligmnents were produced using Clustal. 

20 FIGURE 6 is a diagram indicating a signal for ribosomal frameshifting 

in Tjk dnaX, The diagram shows part of the sequence of the RNA (SEQ. ID. No. 27) 
around the frameshifting site (SEQ. ID. No. 28), including the suspected slippery 
sequence A9 (bold italic). The stop codon in the -2 reading frame is indicated. Also 
indicated are potential step loop structures and the nearest stop codons in the -1 

25 reading frame. 

FIGURE 7 is an image showing a Western analysis of y and x in TJh 
cells. Whole cells were lysed in SDS and electrophoresed on a 1 0 % SDS 
pplyacrylamide gel then transferred to a membrane and probed with polyclonal 
antibody against E. coli y/x as described in Experimental Procedures. Positions of 
30 molecidar weight size markers are shown to the left. Putative Tth: y and x are 
indicated to the right. 

FIGURES 8A-B are images of E, coli colonies expressing Tth. dnaX 
-1 and -2 fiBmeshifts. The region of the dnoX gene slippery sequence was cloned into 
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the /acZ gene of pUC 19 in three reading frames, then transformed into E. coli cells 
and plated on LB plates containing X-gal. The slippery sequence was also mutated by 
inserting two G residues into the A9 sequence and then cloned into pUC19 in all three 
reading frames. Color of colonies observed are indicated by the plus signs. The 
picture shows the colonies, the type of frameshift required for readthrough (blue 
color) is indicted next to the sector. 

FIGURE 9 shows the construction of the fjh ylx expression vector. A 
genomic fragment containing a partial sequence of dnaXv/as cloned into pALTER-1 . 
This fragment was subcloned into pUC19 (p\JC19jinaX). Then the N-terminal 
section of rfwoA' was amplified such that the fragment was flanked by Ndel (at the 
initiating codon) and the intemal BamHI site. This fragment was inserted to form the 
entire coding sequence of the dnaX gene in pUC19 (pUC\9dnaX). The dnaXgenc 
was then cloned behind the polyhistidine leader in the T7 based expression vector 
pET16 to give pETl 6dnaX, Details are in "Experimental Procedures". 

FIGURES l OA-C illxistrate the purification of recombinant TJh, y and 
T subunits. TJfu y and t subunits were expressed in E. coli harboring pETlSdnaX. 
Molecular size markers are shown to the left of the gels, and the two induced proteins 
are labeled as g and t to the right of the gel. Panel A) 10% SDS gel of E. coli whole 
cell lysates before and after induction with IPTG. Panel B) 8% SDS gel of the 
purification two steps after cell lysis. First lane: the lysate was applied to a HiTrap 
Nickel chromatography column. Second lane: the TJhJ ylx subunits were further 
purified on a Superose 12 gel filtration column. Third lane, the E, coli y and t 
subunits. Panel C) Western analysis of the pure Tth, y and x subunits (first lane) and 
E. coli y and x subunits (second lane). 

FIGURES 1 1 A-B show the gel filtration of TJh, y and x. Tth, y and x 
were gel filtered on a Superose 12 column. Colimin flections were analyzed for 
ATPase activity and in a Coomassie iBlue stained 10% SDS polyacrylamide gel. 
Positions of moleciilar weight markers are shown to the left of the gel. The elution 
position of size standards analyzed in a parallel Superose 12 column under identical 
conditions are indicated above the gel. Thyroglobin (670 kDa), bovine gamma globin 
(150 kDa), chicken ovalbumin (44 kDa), equine myoglobin (17 kDa). 

FIGURES 12A-C illustrate the characterization of the T.th y and x 
ATPase activity. The T.th, y/x and E, coli x subunits are compared in their ATPase 



- 15- 

activity characteristics. Due to the greater activity of E. coli x, the values are plotted as 
percent for ease of comparison. Actual specific activities for 100 % values are given 
below as pmol ATP hydrolyzed/30 min./pmol T.th. y/x (or pmol E. coli x). Panel A) 
T,th. y and x ATP£ise is stimulated by the presence of ssDNA. T.th. yh was incubated at 
5 65°C. Specific activity was: 1 1 .5 (+DNA); 2.5 (-DNA); E, coli x was assayed at 37°C. 
Specific activity values were: 11 2.5 (H-DNA); (7.3-DNA). Panel B) Temperature 
stability of DNA stimulated ATPase activity. T.th, y/x, 1 1 .3 (65°C); E. coli x, 97.5 
(37^C). Panel C) Stability of T.th. y/x ATPase to NaCl. T.th. y/x, 8.1 (100 mM added 
NaCl and 65°C); E. coli x, 52.7 (0 M added NaCl and 37^C). 

1 0 FIGURES 13A-13C are graphs that summarize the purification of the 

DNA polymerase III firom T.th. extracts. Panel A) shows the activity and total protein in 
column fi-actions fi-om the Heparin Agarose column. Peak 1 fi-actions were 
chromatographed on ATP agarose. Panel B) depicts the ATP-agarose column step, and 
Panel C) shows the total protein and DNA polymerase activity eluted from the MonoQ 

1 5 column. 

FIGURES 14A-B are SDS polyacrylamide gels of Tjh. subunits. 
Fig. 14A is a 12% SDS polyacrylamide gel stained with Coomassie Blue of the MonoQ 
column. Load stands for the material loaded onto the column (ATP agarose bound 
fi*actions). FT stands for protein that flowed through the MonoQ column. Fractions are 

20 indicated above the gel. Tth. subunits in fi-actions 17-19 are indicated by the labels 
placed between fractions 18 and 19. Additional small subunits may be present but 
difficult to visualize, or may have run off the gel. E. coli y,8 shows a mixture of the a, y, 
and 5 subunits of DNA polymerase III holoenzyme (they are labeled to the right in the 
figure). Fig. 14B shows the Western results of an SDS gel of the MonoQ fractions 

25 probed with rabbit antiserum raised against the E, coli a subunit. Load and FT are as 
described in Panel A. Fraction numbers are shown above the gel. The band that 
comigrates with E. coli a, and the band in the Coomassie Blue stained gel in Panel A, is 
marked with an arrow. This band was analyzed for microsequence and the results are 
shown in Fig. 15. 

30 FIGURES 15A-B show the alignments of the peptides obtained from T.th. 

a subunit, TTHl (shown in A) and TTH2 (shown in B) with the amino acid sequences of 
the a subunits of other organisms. The amino acid number of these regions within each 
respective protein sequence are shown to the right. The abbreviations of the organisms 
are as follows. E.coli - Escherichia coli (SEQ ID NOS: 72 and 79 in 15A-B, 
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respectively), V.choL- Vibrio cholerae (SEQ ID NOS: 73 and 80 in 15A-B, respectively), 
H.inf. - Haemophilus influenzae (SEQ ID NOS: 74 and 81 in 15A-B, respectively), 
R.prow, - Rickettsia prowazekii (SEQ ID NOS: 75 and 82 in 15A-B, respectively), H.pyL 
- Helicobacter pylori (SEQ ID NOS: 76 and 83 in 15A-B, respectively), S,sp. - 
5 Synechocystis sp, (SEQ ID NOS: 77 and 84 in 15A-B, respectively), M.tub. - 

Mycobacterium tuberculosis (SEQ ID NOS: 78 and 85 in 15A-B, respectively), TJh. - 
Thermus thermophilus (SEQ ID NOS: 61 and 60 in 15A-B, respectively). 

FIGURES 16A-C show a nucleotide (Panels A-B, SEQ. ID. No. 86) and 
amino acid (Panel C, SEQ. ID. No. 87) sequence of the dnaE gene encoding the a 

10 subunit of DNA polymerase III replication enzyme. 

FIGURE 17 shows an alignment of the amino acid sequence of s subunits 
encoded by dnaQ of several organisms. The amino acid sequence of the Thermus 
thermophilus s subunit of dnaQ is also shown. T.th,^ Thermus thermophilus (SEQ. ID, 
No. 88); D.rad.^ Deinococcus radiodurans (SEQ. ID. No. 89); Bac.sub., Bacillus subtilis 

15 (SEQ. ID. No. 90); HAnf, Haemophilus influenzae (SEQ. ID. No. 9l)\E.c., Escherichia 
coli (SEQ. ID. No. 92); Hpyl, Helicobacter pylori (SEQ. ID. No. 93), The regions used 
to obtain the inner part of the dnaQ gene are shown in bold. The starts used for 
expression of the Tjh, 8 subunit are marked. 

FIGURES 18A-B show the nucleotide (Panel A, SEQ. ID. No. 94) and 

20 amino acid (Panel B, SEQ. ID. No. 95) sequence of the dnaQ gene encoding the s 
subunit of DNA polymerase III replication enzyme. 

FIGURES 19A-B show an alignment of the DnaA protein of several 
organisms. The amino acid sequence of the Thermus thermophilus DnaA protein is also 
shown. P.mar.^ Pseudomonas marcesans (SEQ. ID. No. 96); Syn.sp., Synechocystis sp. 

25 (SEQ. ID. No. 97); Bac.sub,, Bacillus subtilis (SEQ. ID. No. 98); M. tub; Mycobacterium 
tuberculosis (SEQ. ID. No. 99); T.th., Thermus thermophilus (SEQ. ID. No. 100); 
E.colL, Escherichia coli (SEQ. ID. No. 101); T. war., Thermatoga maritima (SEQ. ID. 
No. 102); and HpyL, Helicobacter pylori (SEQ. ID. No. 103). 

FIGURES 20A-B show the nucleotide (Panel A, SEQ. ID. No. 104) and 

30 amino acid (Panel B, SEQ. ID. No. 105) sequence of the dnaA gene of Thermus 
thermophilus. 

FIGURES 21 A-B show the nucleotide (Panel A, SEQ. ID. No. 106) and 
amino acid (Panel B, SEQ. ID. No. 107) sequence of the dnaNgonc encoding the P 
subunit of DNA polymerase III replication enzyme. 
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FIGURES 22A-B show an alignment of the B subunit of T.th to the B 
subunits of other organisms. TJhyThermus thermophilus (SEQ, ID. No. 108); E. 
colU Escherichia coli (SEQ, ID. No. 109); P. mirab, Proteus mirabilis (SEQ. ID. 
No, 1 1 0); K infl, Haemophilus influenzae (SEQ. ID. No. 1 1 1); P. put, Pseudomonas 
5 putida (SEQ. ID. No. 1 1 2); and B, cap. , Buchnera aphidicola (SEQ. ID. No. 113). 

FIGURE 23 is a map of the pET24:dnaN plasmid. The functional 
regions of the plasmid are indicated by arrows and italic, restriction sites are marked 
with bars and symbols. The hatched parts in the plasmid correspond to Tth, dnoN. 
FIGURES 24A-B show the induction of T,th. B in K coli cells 
10 harboring the T.r/i. B expression vector. Panel A is the cell induction. The first lane 
shows molecular weight markers (MW). The second lane shows uninduced E. coli 
cells, and the third lane shows induced E, coli. The induced Tth. B is indicated by the 
arrow shown to the left. Induced cells were lysed then treated with heat and the 
soluble portion was chrpmatographed on MonoQ. Panel B shows the results of 
15 MonoQ purification of T.r/i. B. 

FIGURE 25A is a schematic depiction of the use of the use of the 
enzymes of the present invention in accordance , with an alternate embodiment hereof. 
In this scheme the clamp (B or PCNA) slides over the end of linear DNA to enhance 
the polymerase (Pol Ill-type such as Pol III, PolB or Pol5.) In this fashion the clamp 
20 loader activity is not needed. 

FIGURE 25B graphically demonstrates the results of the practice of 
the alternate embodiment of the invention described and set forth in Example 1 5, 
infra, Lane 1 , E. coli Pol III without B; Lane 2, E. coli with B; Lane 3, human Pol8 
without PCNA; Lane 4, human Pol5 with PCNA; Lane 5, TJh. Pol III without T.th. B; 
25 Lane 6, T.th. Pol III withT.^/i. B. The respective pmol synthesis in lanes 1-6 are: 6, 
35, 2, 24, 0.6 and 1.9. 

FIGURES 26A-B show the use of Tth. Pol III in extending singly 
primed M13mpl8 to an RFII form. The scheme in Fig. 26A shows the primed 
template in which a DNA 57mer was annealled to the M13mpl 8 ssDNA circle. Then 
30 T.th. B subunit (produced recombinantly) and T.th. Pol III were added to the DNA in 
the presence of radioactive nucleoside triphosphates. In Fig. 26B, the products of the 
reaction were analyzed in a 0.8% native agarose gel. The position of ssDNA starting 
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material, the RFII product, and of intermediate species, are shown to the sides of the 
gel. Lane 1 , use of Pol III. Lane 2, use of the non-Pol III DNA polymerase. 

FIGURE 27 is an SDS poly acrylamide gel of the proteins of the A 
aeo/icM5 replication machinery. 
5 FIGURE 28 is an SDS polyacrylamide gel analysis of the MonoQ 

fractions of the method used to reconstitute and purify the A. aeolicus t66* complex. 

FIGURE 29 is an SDS polyacrylamide gel analysis of the gel filtration 
colunm fractions used in the preparation of tiie A. aeolicus axSS' complex. The 
bottom ^el analysis shows the profile obtained using the ^4. aeolicus a subimit 
10 (polymerase) in the absence of the other subunits. 

FIGURE 30 is an alkaline agarose gel analysis of reaction products for 
extension of a single primer aroxind a 7.2 kb M13mpl8 circular ssDNA genome that 
has been coated with A, aeolicus SSB. The time course on the left are produced by 
axSSVp, and the time course on the right is produced by aT65' in the absence of p. 
15 FIGURE 3 1 is a graph illustrating the optimal temperature for activity 

of the alpha subunit of Thermus replicase using a calf thymus DNA replication assay. . 
Reactions were shifted to the indicated temperature for 5 minutes before detecting the 
level of DNA synthesis activity. 

FIGURE 32 is a graph illustrating the optimal temperature for activity 
20 of the alpha subunit of the Aquifex replicase using a calf thymus DNA replication 
assay. Reactions were shifted to the indicated temperature for 5 minutes before 
detecting the level of DNA synthesis activity. 

FIGURES 33A-E illustrate the heat stability of Aquifex components. 
Assays of either a (Fig. 33 A), p (Fig. 33B), x55Vcomplex (Fig. 33C), SSB (Fig. 33D) 
25 and aT56' complex (Fig. 33E) were performed after heating samples at the indicated 
temperatures. Components were heated in buffer containing the follo\ving: 0.1% 
Triton X-1 00 (filled diamonds); 0.05% Tween-20 and 0.01 % NP-40 (filled circles); 4 
mM (ZaCb (filled triangles); 40% Glycerol {inverted filled triangles); 0.01% Triton X- 
100, 0.05% Tween.20, 0.01% NP-40, 4 mM CaCl2 (half-filled square); 40% Glycerol, 
30 0.1% Triton X-100 (open diamonds); 40% Glycerol, 0.05% Tween-20, 0.01% NP-40 
(open circles); 40% Glycerol, 4 mM CaCh (open triangles); 40% Glycerol, 0.01% 
Triton X-100, 0.05% Tween-20, 0.01% NP-40, 4 mM CaCb (half-filled diamonds). 



FIGURES 34A-B show the nucleotide sequence (SEQ. ID.^^N^^ 
the dnaE gene of ^. aeolicus. 

FIGURE 35 shows the amino acid sequence (SEQ. ID. No. 118) of the 
a subunit of i4. fleo/icus. 

FIGURE 36 shows the nucleotide sequence (SEQ. ID. No. 1 19) of the 
dnaXgeneof A, aeolicus. 

FIGURE 37 shows the amino acid sequence (SEQ. ID. No, 120) of the 
tansubvaatof A. aeolicus. 

FIGURE 38 shows the nucleotide sequence (SEQ. ID. No. 121) of the 
dnaN genQ of A, aeolicus. 

FIGURE 39 shows the amino acid sequence (SEQ. ID. No. 122) of the 
P subunitofA aeo//c:w5. 

FIGURE 40 shows the partial nucleotide sequence (SEQ. ID. No. 123) 
of ihcholA %Qn^ of A, aeolicus. 

FIGURE 41 shows the partial amino acide sequence (SEQ. ID. 
No. 124) of the 8 subunit of A aeo/fcw^. 

FIGURE 42 shows the nucleotide sequence (SEQ. ID. No. 125) of the 
holB gene of A. aeolicus. 

FIGURE 43 shows the amino acid sequence (SEQ. ID. No. 1 26) of the 
8^ sxxhvani of A aeolicus. 

FIGURE 44 shows the nucleotide sequence (SEQ. ID. No. 127) of the 
dnaQ of A. aeolicus. 

FIGURE 45 shows the amino acid sequence (SEQ. ID. No. 128) of the 

e subunit of A, aeolicus, 

FIGURE 46 shows the nucleotide sequence (SEQ. ID. No. 129) of the 
ssb gene of A, aeolicus. 

FIGURE 47 shows the amino acid sequence (SEQ. ID. No. 130) of the 
single-strand binding protein of A. aeolicus. 

FIGURE 48 shows the nucleotide sequence (SEQ. ID. No. 131) of the 
dnaB gene of A. aeolicus. 

FIGURE 49 shows the amino acid sequence (SEQ. ID. No. 132) of the 
DnaB helicase of ^4. aeolicus. 
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FIGURE 50 shows the nucleotide sequence (SEQ. ID. No. 133) of the 
dnaG %^n^ of A, aeolicus, 

FIGURE 5 1 shows the amino acid sequence (SEQ. ID. No. 1 34) of the 
DiMiG priinase of A fleo/fcw5. 

FIGURE 52 shows the nucleotide sequence (SEQ. ID. No. 135) of the 
dnaC gene of A. geolicus. 

FIGURE 53 shows the amino acid sequence (SEQ. ID. No. 136) of the 

D|naC protein of A. aeolictds, 

. FIGURE 54A-B shows the nucleotide sequence (SEQ. ID. No. 137) of 

the dnaE gene of T. maritima. 

FIGURE 55 shows the amino acid sequence (SEQ. ID. No. 138) of the 

a subunit of r. /nar/Yz/wa. 

FIGURE 56 shows the nucleotide sequence (SEQ. ID. No. 139) of the 

dnaQ gene of T. maritima, 

FIGURE 57 shows the amino acid sequence (SEQ. ID. No. 140) of the 

e subunit of T. maritima. 

FIGURE 58 shows the nucleotide sequence (SEQ. ID. No. 141) of the 

dnaX gene of T maritima. 

FIGURE 59 shows the amino acid sequence (SEQ. ID. No. 142) of the 

tau subimit of T. maritima. 

FIGURE 60 shows the nucleotide sequence (SEQ. ID. No. 143) of the 

dnaN gene of T. maritima. 

FIGURE 61 shows the amino acid sequence (SEQ. ID. No, l44) of the 

P subunit of T. maritima, 

FIGURE 62 shows the nucleotide sequence (SEQ. ID. No. 145) of the 

holA gene of T, maritima. 

FIGURE 63 shows the amino acid sequence (SEQ. ID. No. 146) of the 

5 subunit of r, marzYfma. 

FIGURE 64 shows the nucleotide sequence (SEQ. ID. No. 1 47) of the 
hoIB gene of T. maritima. 

FIGURE 65 shows the amino acid sequence (SEQ. ID. No. 148) of the 

5' subunit of T. maritima. 
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FIGUIIE 66 shows the nucleotide sequence (SEQ. ID. No. 149) of the 
ssb gem of T: maritima. 

FIGURE 67 shows the amino acid sequence (SEQ. ID. No. 1 50) of the 
single^sti^d binding protein of r. mar/fima. 
5 FIGURE 68 shows the nucleotide sequence (SEQ. ID. No. 151) of the 

dnaB gene of T. maritima. 

FIGURE 69 shows the amino acid sequence (SEQ. ID. No. 1 52) of the 

ItaaB helicase of r. inariY/ma. 

FIGURE 70 shows the nucleotide sequence (SEQ. ID. No. 153) of the 

10 dnaG%ene ofT. maritima. 

FIGURE 71 shows the amino acid sequence (SEQ. ID. No. 154) of the 

DiiaG primase of r. /narzYz/nfl. 

FIGURE 72 shows the nucleotide sequence (SEQ. ID. No. 155) of the 

holB gene of X thermophilus, 
15 FIGURE 73 shows the amino acid sequence (SEQ. ID. No. 156) of the 

svhmixi of T. thermophilus. 

FIGURE 74 shows the nucleotide sequence (SEQ. ID. No. 157) of the 
holA gene of ihermophilus. 

FIGURE 75 shows the amino acid sequence (SEQ. ID. No. 158) of the 

20 8 subunit of r. //termcip/ii/us'. 

FIGURE 76 shows the nucleotide sequence (SEQ. ID. No. 171) of the 
ssbgeneofT.thermophilus. 

FIGURE 77 shows the amino acid sequence (SEQ. ID. No. 1 72) of the 
single-strand binding protein of T. thermophilus. 
25 FIGURE 78 shows the partial nucleotide sequence (SEQ. ID. No. 173) 

of ihednoN ^ene of B. stearothermophilus. 

FIGURE 79 shows the partial amino acid sequence (SEQ. ID. No. 1 74) 
of the P subunit of B. stearGikermopkilus. 

FIGURE 80 shows the nucleotide sequence (SEQ. ID. No. 175) of the 
3 0 ssb gene of B. stearothermophilus. 

FIGURE 81 shows the amino acid sequence (SEQ. ID. No. 176) of the 
single-strand binding protein of B. stearothermophilus. 



.22- 



FIGURE 82 shows the nucleotide sequence (SEQ. ID. No. 177) of the 
hoLA gene of B, stearothermophilus. 

FIGURE $3 shows the aniino acid sequence (SEQ. ID. No. 1 

5 subunit ofB. stearothermophilus. 
5 FIGURE 84 shows the nucleotide sequence (SEQ. ID. No. 179) of the 

hoIB gene of B. stearothermophiltis. 

FIGURE 85 shows the amino acid sequence (SEQ. ID. No. 180) of the 
suhunii of B. stearothermophilus, 

FIGURES 86A-B show the partial nucleotide sequence (SEQ. ID. 
10 lio. ISl) of ihednaX^ gene of B, stearothermophiltis. 

FIGURE 87 shows the partial amino acid sequence (SEQ. ID. No. 1 82) 
of the tau subunit of 5 .yrearor/iermop/i//w5. 

FIGURES 88 A-B show the nucleotide sequence (SEQ. ID. No. 1 83) of 
the polC gene of B. stearothermophiltis, 
15 FIGURE 89 shows the amino acid sequence (SEQ. ID. No. 184) of the 

Pole or a-large subunit of A 5rearo//iermo/?W/Mj. 

DETAILED DESCRIPTION OF THE INVENTION 

20 In accordance with the present invention there may be employed 

conventional molecular biology, microbiology, and recombinant DNA techniques 
within the skill of the art. Such techniques are explained fully in the literature. See, 
e.g., Sambrook et al., ^*Molecular Cloning: A Laboratory Manual" (1 989); "Cunent 
Protocols in Molecular Biology" Volumes I-III (Ausubel, R. M., ed.) (1994); "Cell 

25 Biology: A Laboratory Handbook" Volumes I-III (Celis, J.E., ed.) (1994); "Current 
Protocols in Immunology" Volumes HII (Coligan, J.E., ed.) (1994); "Oligonucleotide 
Synthesis" (M.J. Gait, ed.) (1984); "Nucleic Acid Hybridization" (B.D. Hames & 
S.J. Higgins, eds.) (1985); "Transcription And Translation" (B.D. Hames & S.J. 
Higgins, eds.) (1 984); "Animal Cell Culture" (R.I. Freshney, ed.) (1 986); 

30 "Immobilized Cells And Enzymes" (IRL Press) (1986); B. Perbal, "A Practical Guide 
To Molecular Cloning" (1984), each of which is hereby incorporated by reference. 

Therefore, if appearing herein, the following terms shall have the 
definitions set out below. 
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The tenns "DNA Polymerase III/' 'Tolymerase Ill-type enzy^ 
"Polymerase III enzyme complex(s)", 'T-f/z. DNA Polymerase III", "yi.ae. DNA 
Polymerase III'*, "r,ma.DNA Polymerase III", and any variants riot specifically listed, 
may be used herein interchangeably, as are p subimit and sliding clamp and clamp as 
are also y complex, clamp loader, and RFC, as used throughout the present application 
and claims refer to proteinaceous material including single or multiple proteins, and 
extends to those proteins having the amino acid sequence data described herein and 
presented in the Figures and corresponding Sequence Listing entries, and the 
corresponding profile of activities set forth herein and in the Claims. Accordingly, 
proteins displaying substantially equivalent or altered activity are likewise 
contemplated. These modifications may be deliberate, for example, such as 
modifications obtained through site-directed mutagenesis, or may be accidental, such 
as those obtained through mutations in hosts that are producers of the complex or its 
named subunitis. Also, the terms "DNA Polymerase III," "r.r/i. DNA Polymerase III," 
and "y and x subunits", "B subunit", "a subunit", "s subunit", "S subunit", "6' 
subunit", "SSB protein", "sliding clamp" and "clamp loader" are intended to include 
vsdthin their scope proteins specifically recited herein as well as all substantially 
homologous analogs and allelic variations. As used herein y complex refers to a 
particular type of clamp loader that includes a y subvinit. 

Also as used herein, the term "thermolabile enzyme" refers to a DNA 
polymerase which is not resistant to inactivation by heat. For example, T5 DNA 
polymerase, the activity of which is totally inactivated by exposing the enzyme to a 
temperature of 90°C for 30 seconds, is considered to be a thermolabile DNA 
polymerase. As used herein, a thermolabile DNA polymerase is less resistant to heat 
inactivation than in a thermostable DNA polymerase. A thermolabile DNA 
polymerase typically will also have a lower optimum temperature than a thermostable 
DNA polyriierase. Thermolabile DNA polymerases are typically isolated fi-om 
mesophilic organisms, for example mesophilic bacteria or eukaryotes, including 
certain animals. 

As used herein, the term "themiostable enzyme" refers to an enzyme 
which is stable to heat and is heat resistant and catalyzes (facilitates) combination of 
. the nucleotides in the proper manner to form the primer extension products that are 
complementary to each nucleic acid strand. Generally, the synthesis will be initiated 
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strand, until syndesis terminates, producing molecules of different lengths. 

The thermostable enzyme herein must satisfy a single criterion to be 
effective for the amplification reaction, i.e., the enzyme must not become irreversibly 
denatured (inactivated) when subjected to the elevated temperatures for the time 
necessary to effect denaturation of double-stranded nucleic acids. Irreversible 
denaturation for purposes herein refers to permanent and complete loss of enzymatic 
activity. The heating conditions necessary for denaturation will depend, e.g., on the 
buffer salt concentration and the length and nucleotide composition of the nucleic 
acids being denatured, but typically range from about 90G° to about 96^G for a tinae 
depending mainly on the temperature and the nucleic acid lengtii, typically about 0.5 
to four minutes. Higher temperatures may be tolerated as the buffer salt concentration 
and/or GC composition of the nucleic acid is increased. Preferably, the enzyme will 
not become irreversibly denatured at about 90°- 100*^C. 

The theimostable enzymes herein preferably have an optinium . 
temperature at which they function that is higher than about 40^*0, which is the 
temperature below which hybridization of primer to template is pronaoted, although, 
depending on (1) magnesium and salt concentrations and (2) composition and length 
of primer, hybridization can occur at higher temperature (e.g., 45**-70*'C). Tlje higher 
the temperature optimum for the enzyme, the greater the specificity and/or selectivity, 
of the primer-directed extension process. However, enzymes that are active below 
40''C, e.g., at are also within the scope of this invention provided they are heat- 
stable. Preferably, the optimum temperature ranges from about SO"" to about 90°C, 
more preferably about 60° to about 80°C. In this connection, the term "elevated 
temperature" as used herein is intended to cover sustained temperatxires of operation 
of the enzyme that iare equal to or higher than about 60°C. 

The term "template" as used herein refers to a double-stranded or 
single-stranded DNA molecule which is to be amplified, synthesized, or sequenced. 
In the case of a double-stranded DNA molecule, denaturation of its strands to form a 
first and a second strand is performed before these molecules may be amplified, 
synthesized or sequenced. A primer, complementary to a portion of a DNA template 
is hybridized under appropriate conditions and the DNA polymerase of the invention 
may then synthesize a DNA molecule complementary to said template or a portion 
thereof The newly synthesized DNA molecule, according to the invention, may be 



equal or shorter in length than the original DNA template. Mismatch incorporation 
during the synthesis or extension of the newly synthesized DNA molecule may result 
in one or a nvimber of mismatched base pairs. Thus, the synthesized DNA molecule 
need not be exactly complementary to the DNA template. 

The term "incorporating" as used herein means becoming a part of a 
DNA molecule or primer. 

As used herein "amplification" refers to any in vitro method for 
increasing the number of copies of a nucleotide sequence, or its complimentary 
sequence, with the use of a DNA polymerase. Nucleic acid amplification results in 
the incorporation of nucleotides into a DNA molecule or primer thereby forming a 
new DNA molecule complementary to a DNA template. The formed DNA molecule 
and its template can be used as templates to synthesize additional DNA molecides. 
As used herein, one amplification reaction may consist of many rounds of DNA 
replication. DNA amplification reactions include, for example, polymerase chain 
reactions (PGR). One PGR reaction may consist of about 20 to 1 00 "cycles" of 
denaturation and synthesis of a DNA molecule. In this connection, the use of the term 
"long stretches of DNA" as it refers to the extension of primer along DNA is intended 
to cover such extensions of an average length exceeding 7 kilobases. Naturally, such 
length will vary, and all such variations are considered to be included within the scope 
of the invention. 

As used herein, the term "holoenzyme" refers to a multi-subimit DNA 
polymerase activity comprising and resulting from various subunits which each may 
have distinct activities but which when contained in an enzyme reaction operate to 
carry out the function of the polymerase (typically DNA synthesis) and enhance its 
activity over use of the DNA polymerase subunii alone. For example, E. coli DNA 
polymerase III is a holoenzyme comprising three components of one or more subunits 
each: (1) a core component consisting of a heterotrimer of a, e and 9 subimits; (2) a P 
component consisting of a p subunit dimer; and (3) a y complex component consisting 
of a heteropentamer of y, 5, h\ % and v|/ subunits (see Studwell and O'Donnell, 1990). 
These three components, and the various subunits of which they consist, are linked 
non-covalently to form the DNA polymerase III holoenzyme complex. However, 
they also function when not linked in solution. 
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As used herein/'eiizyme complex'' refers to a protein stractiff 
consisting essentially of two or more subunits of a replication enzyme, which may or 
may not be identical, noncovalently linked to each other to form a multi-subunit 
structure. An enzyme complex according to this definition ideally will have a 

5 particular enzymatic activity, up to and including the activity of the replication 
enzyme. For example, a "DNA pol III enzyme complex" as used herein means a 
multi-subunit protein activity comprising two or more of the subimits of the pNA pol 
in replication enzynie as defined above, and having DNA polymerizing or 
synthesizing activity. Thus, this term encompasses the native replication enzyme, as 

10 well as an enzyme complex lacking one or more of the subunits of the replication 
enzyme (e.g., DNA pol III exo-, which lacks the e subunit). 

The amino acid residues described herein are preferred to be in the "L" 
isomeric form. However, residues in the "D" isomeric form can be substituted for any 
L-amino acid residue, as long as the desired functional property of inMnunoglobulin- 

15 binding is retained by the polypeptide. NH2 refers to the free amino group present at 
the amino terminus of a polypeptide. COOH refers to the free carboxy group present 
at the carboxy terminus of a polypeptide. In keeping with standard polypeptide 
nomenclature, J. BioL Chem , 243:3552-59 (1969), abbreviations for amino acid 
residues are shown in the following Table of Correspondence: 

20 

TABLE OF CORRESPONDENCE 



SYMBOLS 




AMINO ACID 


1 -Letter 


3-Letter 




Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


•.F. 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


s ■ 


Ser 


serine 


I 


He 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


W 


Trp 


tryptophan 



R Arg 

D Asp 

N Asn 

C Cys 

It shovild be noted that all amino-acid residue sequences are represented herein by 
formulae whose left and right orientation is in the conventional direction of aminp- 
terminus.to carboxy-terminus. Furthermore, it should be noted that a dash at the 
beginning or end of an amino acid residue sequence indicates a peptide bond to a 
further sequence of one or more amino-acid residues. The above Table is presented to 
correlate the three-letter and one-letter notations which may appear alternately herein. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) 
that functions as.an autonomous unit of DNA replication in vivo; i.e., capable of 
replication trader its own control. 

A "vector" is a replicon, such as plasmid, phage or cosmid, to which 
another DNA segmeiit may be attached so as to bring about the replication of the 

attached segment. 

A "DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (adenine, guanine, thymine, or cytosine) in its either single . 
stranded form, or a double-stranded helix, this term refers only to the primary and 
secondary structure of the molecule, and does not limit it to any particular tertiary 
forms. Thus, this temi includes double-stranded DNA found, inter alia, in linear 
DNA molecules (e.g., restriction fragments), viruses, plasmids, and chromosomes. In 
discussing the structure of particular double-stranded DNA molecules, sequences may 
be described herein according to the normal convention of giving only the sequence in 
the 5' to 3' direction along the nontranscribed strand of DNA (i.e., the strand having a 
sequence homologous to the mRNA). 

An "origin of replication" refers to those DNA sequences that 

participate in DNA synthesis. 

A DNA "coding sequence" is a double-stranded DNA sequence which 
is transcribed and translated into a polypeptide in vivo when placed under the control 
of appropriate regulatory sequences. The boundaries of the coding sequence are 
detemiined by a start codon at the 5' (amino) terminus and a translation stop codon at 
the 3' (carboxyl) temiinus. A coding sequence can include, but is not limited to, 
prokaryotic sequences, cDNA from eukaryotic mRNA, genomic dUa sequences 



arginine 
aspartic acid 
asparagine 
cysteine 



from eukaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. A 
polyadenylation signal and transcription termination sequence vAll usually be located 

3' to the coding sequence. 

Transcriptional and translational control sequences are DNA 

regulatory sequences, such as promoters, enhancers, polyadenylation signals, 
terminators, and the like, that provide for the expression of a coding sequence in a 
host cell. 

A "promoter sequence" is a DNA regulatoiy region capable of binding 
RNA polymerase in a cell and initiating transcription of a downstream (3- direction) 
coding sequence. For purposes of defining the present invention, the promoter 
sequence is bounded at its 3 V terminus by the transcription initiation site 
upstream (5' direction) to include the miriimuni number of bases or elements 
necessary to initiate transcription at levels detectable above background. Within the 
promoter sequence will be found a transcription initiation site (conveniently defined 
by mapping with nuclease SI), as well as protein binding domains (consensus 
sequences) responsible for the binding of RNA polymerase. Eukaryotic promoters 
vsdll often, but not always, contain "TATA" boxes and "CAT" boxes. Prokaryotic 
promoters contain Shine-Dalgamo sequences in addition to the -10 and -35 consensus 
sequences. 

Ah "expression control sequence" is a DNA sequence that controls and 
regxilates the traiiscription and translation of another DNA sequence. A coding 
sequence is '*uhder the control" of transcriptional and translational control sequences 
in a cell when RNA polymerase transcribes the coding sequence into mRNA, which is 
then translated into the protein encoded by the codmg sequence. 

A "signal sequence" can be included before the coding sequence. This 
sequence encodes a signal peptide, N-terminal to the polypeptide, that communicates 
to the host cell to direct the polypeptide to the cell surface or secrete the polypeptide 
into the media, and this signal peptide is clipped off by the host cell before the protein 
leaves the cell. Signal sequences can be found associated with a variety of proteins 
native to prokaryotes and eukaryotes. 

The term "oligonucleotide," as used generally herein, such as in 
referring to probes prepared and used in the present invention, is defined as a 
molecule comprised of two or more (deoxy)ribonucleotides, preferably more than 
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three. Its exact size will depend upon many factors which, in turn, depend upon the 
ultimate function and use of the oligonucleotide. 

The terai "piimer" as used herein refers to an oligonucleotide, whether 
occurring naturally as in a purified restriction digest or produced synthetically, which 
is capable of acting as a point of initiation of synthesis when placed under conditions 
in which synthiesis of a primer extension product, which is complementary to a 
nucleic acid strand, is induced, i.e., in the presence of nucleotides and an inducing 
agent such as a DNA polymerase and at a suitable temperature and pH. The primer 
may be either single-stranded or double-stranded and must be sufficiently long to 
prime the synthesis of the desired extension product in the presence of the inducing 
agent. The exact length of the primer will depend upon many factors, including 
temperature, source of primer and use of the method. For example, for diagnostic 
applications, depending on the complexity of the target sequence, the oligonucleotide 
primer typically contains 15-25 or more nucleotides, although it may contain fewer 
nucleotides. 

The primers herein are selected to be "substantially" complementary to 
different strands of a particular target DNA sequence. This means that the primers 
must be sufficiently complementary to hybridize with their respective strands. 
Therefore, the primer sequence need not reflect the exact sequence of the template. 
For example, a non-complementary nucleotide firagment may be attached to the 5' end 
of the primer, with the remainder of the primer sequence being complementary to the 
strand. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the primer, provided that the prinier sequence has sufficient 
complementarity with the sequence of the strand to hybridize therewith and thereby 
form the template for the synthesis of the extension product. 

As used herein, the terms "restriction endonucleases" and "restriction 
enzymes" refer to bacterial enzymes, each of which cut double-stranded DNA at or 
near a specific nucleotide sequence . 

A cell has been "transformed" by exogenous or heterologous DNA 
when such DNA has been introduced inside the cell. The transforming DNA may or 
may not be integrated (covalently linked) into chromosomal DNA making up the 
genome of the cell. In prokaryotes, yeast, and mammalian cells for example, the 
transforming DNA may be maintained on an episomal element such as a plasmid. 
With respect to eukaryotic cells, a stably transformed cell is one in which the 



transforming DNA has become integrated into a chromosome so that it is inherited by 
daughter cells through chromosome replication. This stability is demonstrated by the 
ability of the eukaryotic cell to establish cell lines or clones comprised of a population 
of daughter cells containing the transforming DNA. A "clone" is a population of cells 
derived from a single cell or common ancestor by mitosis. A "cell line" is a clone of 
a primary cell that is capable of stable growth in vitro for many generations. 

. Two DNA sequences are "substantially homologous" when at least 
about 75% (preferably at least about 80%, and most preferably at least about 90 or 
95%) of the nucleotides match over the defined length of the DNA sequences. 
Sequences that are substantially homologous can be identified by comparing the 
sequences using standard software available in sequence data banks, or in a Southern 
hybridization experiment under, for example, stringent conditions as defined for that 
particular system. Suitable conditions include those characterized by a hybridization 
buffer comprising 0.9M sodium citrate ("SSC") bxififer at a temperature of about 37**C 
and washing in SSC buffer at a temperature of about 37''C; and prefeiiably in a 
hybridization buffer comprising 20% formamide in 0.9M SSC buffer at a temperature 
ofabout 42°C and washing with 0.2x SSC buffer at about 42^C. Stringency 
conditions can be further varied by modifying the temperature and/or salt content of 
the buffer, or by modifying the length of the hybridization probe as is known to those 
of skill in the art. Defining appropriate hybridization conditions is within the skill of . 
the art. See, e.g., Mamatis et al., 1982; Glover, 1985; Hames and Higgins, 1984. 

It should be appreciated that also within the scope of the present 
invention are degenerate DNA sequencies. By "degenerate" is meant that a different 
three-letter codon is used to specify a particular amino acid. It is well laiown in the 
art that the following codons can be used interchangeably to code for each specific 
amino acid: 



Phenylalanine (Phe or F) UUUorUUC 

Leucine (Leu or L) UUA or UUG or CUU or CUC or CUA or CUG 

Isoleucine (lie or I) AUU or AUC or AUA 

Methionine (Met or M) AUG 

Valine (Val or V) GUU or GUC of GU A or GUG 

Serine (Ser or S) UCU or UCC or UCA or UCG or AGU or AGC 
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Proline (Pro or P) 


ecu or CCC or CCA or CCG 


Threonine (Thr or T) 


ACU or ACC or ACA or ACG 


Alanine (Ala or A) 


GCU or GCG or GCA or GCG 


Tyrosine (Tyr or Y) 


UAUorUAC . 


Histidine (His or H) 


CAU or GAC 


Glutamine (Gln.Or Q) 


CAAorCAG 


Aspara^e (Asn or N) 


AAUbrAAC 


Lysine (Lys or K) 


AAAorAAG 


Aspartic Acid (Asp or D) 


GAUorGAC 


Glutamic Acid (Glu or E) 


GAAorGAG 


Cysteine (Gys or C) 


UGUorUGC 


Arginine (Arg or R) 


CGU or CGC or CGA or CGG or AGA or AGG 


Glycine (Gly or G) 


GGU or GGC or GGA or GGG 


Tryptophan (Tip or W) 


UGG 


Termination codon 


UAA (ochre) or UAG (amber) or UGA (opal) 



It should be understood that the codons specified above are for RNA sequences. The 
corresponding codons for DNA have a T substituted for U. 

Mutations can be made, e.g., in SEQ. ID. No. 1 , or any of the nucleic 

20 acids set forth herein, such that a particular codon is changed to a codon which codes 
for a different amino acid. Such a mutation is generally made by making the fewest 
nucleotide changes possible. A substitution mutation of this sort can be made to 
change an amino acid in the resulting protein in a non-conservative manner (i.e., by 
changmg the codon from an amino acid belonging to a grouping of amino acids 

25 having a particular size or characteristic to an amino acid belonging to another 

grouping) or in a conservative manner (i.e., by changing the codpn from an amino 
acid belonging to a grouping of amino acids haying a particular size or characteristic 
to an amino acid belonging to the same grouping). Such a conservative change 
generally leads to less changie in the structure and function of the resulting protein. A 

30 non-conservative change is more likely to alter the structure, activity or function of 
the resulting protein. The present invention should be considered to include 
sequences containing conservative changes which do not significantly alter the 
activity or binding characteristics of the resulting protein. 



-32. 



The following is one example of various groupings of amino acids: 
Amino acids with nonnolar R groups 
. Alanine 
Valine 
5 Leucine 
Isoleucine 
Proline 
Phenylalanine 

Tryptophan • 
10 Methionine 

Amino acids with uncharged polar R groups 
Glycine 
Serine 
15 Threonine 
Cysteine 
Tyrosine 
Asparagine 
Glutamine 

20 

Amino acids with charged polar R groups (negatively charged at pH 6.0) 
Aspartic acid 
Glutamic acid 

25 Basic amino acids (positively charged at pH 6.0) 
Lysine 
Arginine 

Histidine (atpH 6.0) 

30 Amino acids with phenyl groups : 
Phenylalanine 
Tryptophan 
Tyrosine 



AnptKfer grouping may be according to molecular weight (i.e., size of R groups): 


Glytme ' 


75 


Alanine 


89 


Serine 


105 . 


ProlfiSeS 


115 




117 


Thiwnin 


119 




121 


Leuaiie*, 


131 


IsblSuiime 


131 


Asparagine 


132 


Asp^tic acid 


133 


Glutlmiiiie 


146 


Lysine 


146 


Gltitamic acid . 


147 


Me^onine 


149 , 


HistiMe(^pH6.0) 


155 


Phenylalanine 


165 


Arginihe . 


174 


Tyrosine 


181 


Tiyptpphan 


204 



Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a fi^e NH2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an 
amino acid with a particularly preferable property. For example, a Cys may be 
introduced into a potential site for disulfide bridges with another Cys. A His may be 
introduced as a particularly "catalytic" site (i.e.. His can act as an acid or base and is 
the most common amino acid in biochemical catalysis). Pro may be introduced 
because of its particularly planar structure, which induces p-tums in the protein's 
structure. 



Two amino acid sequences are "substantially homologous*' when at 
least about 70% of the amino acid residues (preferably at least about 80%^ and most 
preferably at least about 90 pr 95%) are identicsd, or represent conservative 
substitutions. 

A "heterologous" region of the DN A construct is an identifiable 
segment of DNA within a larger DNA molecule that is not found in association with 
the larger molecule in nature. Thus, when the heterologous region encodes a 
manmialian gene, the gene will usually be flanked by DNA that does not flank the 
mammalian genomic DNA in the genome of the source organism. Another example 
of a heterologous coding sequence is a construct where the coding sequence itself is 
not fovmd in nature (e.g., a cDNA where the genomic coding sequence contains 
introns, or synthetic sequences having codons different than the native gene). Allelic 
variations or naturally-occxming mutational events do not give rise to a heterologous 
region of DNA as defined herein. 

An "antibody" is any immunoglobulin, including antibodies and 
fi-agments thereof, that binds a specific epitope. The term encompasses polyclonal, 
monoclonal, and chimeric antibodies, the last mentioned described in fiuther detail in 
U.S. Patent Nos. 4,81 6,397 to Boss et al. and 4,8 1 6,567 to Cabilly et al. 

An "antibody combiriing site" is that structural portion of an antibody 
molecule comprised of heavy and light chain variable and hypeirvariable regions that 
specifically binds antigen. 

The phrase "antibody niolecule" in its various granimatical forms as 
used herein contemplates both an intact immunoglobulin molecule and an 
immunologically active portion of an immunoglobulin molecule. Exemplary 
antibody molecules are intact immunoglobulin molecules, substantially intact 
immunoglobulin molecules and those portions of an immunoglobulin molecule that 
contains the paratope, including those portions known in the art as Fab, Fab', F(ab')2 
and F(v), which portions are preferred for use in the therapeutic methods described 
herein. Fab and F(ab*)2 portions of antibody molecules are prepared by the proteolytic 
reaction of papain and pepsin, respectively, on substantially intact antibody molecules 
by methods that are well-known. See for example, U.S. Patent No. 4,342,566 to 
Theofilopolous et al. Fab' antibody molecule portions are also well-known and are 
produced fi-om F(ab*)2 portions followed by reduction of the disulfide bonds linking 
the two heavy chain portions as with mercaptoethanol, and followed by alkylation of 



the resxilting protein mercaptan with a reagent such as iodoaceta^ An antibody 
containing intact antibody molecvdes is preferred herein. 

The phrase "monoclonal antibody" in its various grammatical forms 
refers to an antibody having only one species of antibody combining site capable of 
immimoreacting with a particular antigen. A monoclonal antibody flius typically 
displays a single binding affinity for any antigen with which it immunoreacts. A 
monoclonal antibody may therefore contain an antibody mblecule having a plurality 
of antibody combining sites, each inmiimospecific for a different antigen; e.g., a 
bispecific (chimeric) monoclonal antibody. 

A DNA sequence is "operatively linked" to an expression control 
sequence when the expression control seqiience controls and regulates the 
transcription and translation of that DNA sequence. The term "operatively linked" 
includes having an appropriate start signal (e.g., ATG) in jfront of the DNA sequence 
to be expressied and maintaining the correct reading frame to permit expression of the 
DNA sequence imder the control of the expression control sequence and production of 
the desired product encoded by the DNA sequence. If a gene that one desires to insert 
into a recombinant DNA molecule does not contain an appropriate start signal, such a 
start signal can be inserted in front of the gene. 

The term "standard hybridization conditions'* refers to salt and 
temperature conditions substantially equivalent to 5x SSC and 65°C for both 
hybridization and wash. However, one skilled in the art will appreciate that such 
"standard hybridization conditions" are dependent on particular conditions including 
the concentration of sodium and magnesium in the buffer, nucleotide sequence length 
and concentration, percent mismatch, percent formamide, and the like. Also 
important in the determination of "standard hybridization conditions" is whether the 
two sequences hybridizing are RNA-KNA, DNA-IDNA or RNA-DNA. Such standard 
hybridization conditions are easily determined by one skilled in the iart according to 
well known formulae, wherein hybridization is typically 1 0-20°C below the predicted 
or detemiined Tn, with washes of higher stringency, if desired. 

In its primary aspect, the present invention concerns the identification 
of a class of DNA Polymerase Ill-type enzymes or complexes found in thermophilic 
bacteria such as Thermus thermophilus (TJh.), Aquifex aeolicus (A.ae.), Thermotoga 
maritima (T.ma.), Bacillus stearothermophilus (B,sL) arid other eubacteria which 
exhibit the following characteristics, among their properties: the ability to extend a 
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primer over a long stretch of ssDlSf A at elevated temperature, stimulation by its 
cognate sliding clamp of the type that is assembled on DNA by a clariip loader, 
accessory subunits that exhibit DNA-stimulated ATPase activity at elevated 
temperature and/or ionic strength, and an associated 3-5' exonuclease activity. In a 
particular aspect, the invention extends to Polymerase Ill-type enzymes derived from 
a broad class of thermophilic eubacteria that include polymerases isolated from the 
thermophilic bacteria Aquifex aedlicus {A.ae: polymerase) and other members of the 
Aquifex genus; Thermits thermophilus {TJh. polymerase), Thermusfqvus (Tfl/Tub 
polymerase), Thermus ruber (Tru polymerase), Thermus brockianus 
(DYNAZYME™ polymerase) and other members of the Thermus genus; Bacillus 
stearothermophilus (55/ polymerase) and other members of the Bacillus genus; 
Thermoplasma acidophilum (Tac polymerase) and other members of the 
Thermoplasma genus; and Thermotoga neapolitana (Tne polymwase; Sfee WO 
96/1 0640 to Chatterjee et al.), Thermotoga maritima (Tma polymerase; See U.S. 
Patent No. 5,374,553 to Gelfand et al.), and other members of the Thermotoga genus. 
The particular polymerase discussed herein by way of illustration and not limitation, 
is the enzyme derived from Tth., A.ae., Tma., or B,st, 

Polymerase Ill-type enzymes covered by tiie invention include those 
that may be prepared by purification from cellular material, as described in detail in 
the Examples infray as well as enzyme assemblies or complexes that comprise the 
combination of individually prepared enzyme subvmits or components. Accordingly, 
the entire enzyme may be prepared by purification from cellular material, or may be 
constructed! by the preparation of the individual components and their assembly into 
the fimctional enzyme. A representative and non-limitative protocol for the 
preparation of an enzyme by this latter route is set forth in U.S. Patent No. 5,583,026 
to O'Donnell, and the disclosure thereof is incorporated herein in its entirety for such 
pvirpose. 

Likewise, individiial subunits may be modified, e.g. as by 
incorporation therein of single residue substitutions to create active sites therein, for 
the purpose of imparting new or enhanced properties to enzymes containing the 
modified subxmits (see, e.g.. Tabor, 1995). Likevsdse, individual subunits prepared in 
accordance with the invention, may be used individually and for example, may be 
substituted for their coimterparts in other enzymes, to improve or particularize the 



properties of the resultant modified en2yme. Such modifications are withiri the skill 
of the art and are considered to be included within the scope of the present invention. 

Accordingly, the invention includes the varioiis subunits that may 
comprise the enzymes, and accordingly extends to the genes and corresponding 
proteins that may be encoded thereby, such as the a (as well as PolC), P, y, e, x, 5 and 
6' subunits, respectively. More particularly, in Thermus thermophilus the a subunit 
corresponds to dnaE, the B subunit corresponds to dnaN, the e subunit corresponds to 
dnaQ^ and the y and x subimits correspond to dncxX, the 5 subunit corresponds to 
holA, and the 8* subunit corresponds to holB. In Aquifex aeolicus and Thermotoga 
maritimay itic a subunit corresponds to dnaE, the P subunit corresponds to dhoAT, the e 
subxmit corresponds to dnaQ, ihe x subunit corresponds to dnaX, the 8 subimit 
corresponds to hblA, and the 8' subunit corresponds to hoIB. In Bacillus 
stearothermophiltis, the PolC which has both a and e activities corresponds to polCy 
the p subunit corresponds to dnaN, the 8 subunit corresponds to dnaQ^ the x subunit 
corresponds to dnaX, the 5 subimit corresponds to hoJA, and the 8' subunit 
corresponds to Ao/jB. 

Accordingly, the Polymerase Ill-type enzyme of the present invention 
comprises at least one gene encoding a subunit thereof, which gene is selected from 
the group consisting of dnaX, dnaQ, dnaE, dnaN, holA, holB, and combinations 
20 thereof. More particularly, the invention extends to the nucleic acid molecule 
encoding them and their encoded subunits. 

In the T.tk Pol III enzyme, this includes the following nucleotide 
sequences: dnaX(SEQAD. No. 3), dnaE (SEQ. ID. No: 86), dnaQ (SEQ. ID. No. 94), 
dnaN (SEQ. ID. No. 1 06), holA (SEQ. ID. No. 157), and holB (SEQ. ID. No. 1 55). 
25 In the A.ae. Pol III enzyme, this includes the follovwhg nucleotide 

sequences: dnaX(SEQ. ID. No. 1 19)^ dnaE (SEQ. ID. No. 1 17), dnaQ {SEQ. ID. No. 
127), dnaN iSEQAD. No. 121), holA (SEQ. ID. No. 123), and holB (SEQ. ID. No. 
125). 

In the T.ma. Pol III enzyme, this includes the following nucleotide 
30 sequences: dnaX(SEQ. ID. No. 141), dnaE (SEQ. ID. No. 137), dnaQ (SEQ. ID. No. 
139), dnaN (SEQ. ID. No. 143), holA (SEQ. ID. No. 145), and hoIB (SEQ. ID. No. 
,147). . 



In the B,sL Ppl III enzyme, this includes the foUovsdng nucleotide 
sequences: rfmzAT (SEQ. ID. No. 181), dnaN(SEQ.lD. No. 173), hoIA (SEQ. ID. No. 
177), hoIB (SEQ. ID. No. 179), and polC (SEQ. ID. Nos. 183). 

In each of the Pol III type enzymes of the present invention, not only 
are each of the above-identified coding sequences contemplated, but also conserved 
variants, active fragments and analogs thereof. 

A particular TJft Pplymerase Ill-type enzyme in accordance with the 
invention may include at least one of the following stib-units: a y subvmit having an 
amino acid sequence corresponding to SEQ. ID. Nos. 4 and 5; a x subunit having an 
amino acid sequence corresponding to SEQ. ID. No. 2; a s subuxiit having an amino 
acid sequence corresponding to SEQ. ID. No. 95; a a subxmit including m amino acid 
sequence corresponding SEQ. ID. No. 87; a p subxmit having an amino acid sequence 
corresponding to SEQ. ID. No. 107; a 5 subunit having an amino acid sequence 
corresponding to SEQ. ID. No. 1 58; a 8* subunit having an amino acid sequence 
corresponding to SEQ. ID. No. 1 56; as well as variants, including allelic variants, 
muteins, analogs and fragments of any of the subvmits, and compatible combinatioiis 
thereof, capable of functioning in DN A ampUfication and sequencing. 

A particular A.ae. Polymerase Ill-type enzyme in accordance with the 
invention may include at least one of the following sub-units: a x subunit having an 
amino acid sequence corresponding to SEQ. ID. No. 120; a 8 subunit havmg an amino 
acid sequence corresponding to SEQ. ID. No. 128; a a subunit including an amino 
acid sequence corresponding to SEQ. ID. No. 1 18; a fi subunit having an amino acid 
sequence corresponding to SEQ. ID. No. 122; a 5 subxmit having an amino acid 
sequence corresponding to SEQ. ID. No. 124; a 5' subunit having an amino acid 
sequence corresponding to SEQ. ID. No. 126; as well as variants, including allelic 
variants, muteins, analogs and fragments of any of the subunits, and compatible 
combinations thereof, capable of functioning in DNA amplification and sequencing. 

A particular T.ma. Polymerase Ill-type enzyme in accordance with the 
invention may include at least one of the following sub-units: a x subunit having an 
amino acid sequence corresponding to SEQ. ID. No. 142; a s subunit having an amino 
acid sequence corresponding to SEQ. ID. No. 1 40; a a subunit including an amino 
acid sequence corresponding to SEQ. ID. No. 1 38; a B subunit having an amino acid 
sequence corresponding to SEQ. ID. No. 144; a 5 subunit having an amino acid 
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sequence corresponding to SEQ. ID. No. 146; a 6' subunit having an amino acid 
sequence corresponding to SEQ. ID. No. 148; as well as variants, including allelic 
variants, muteins, analogs and fragments of any of the subunits, and compatible 
combinations thereof, capable of functioning in DNA amplification and sequencinig. 

A particular B.sL Polymerase Ill-typle enzyme in accordance with the 
invention may include at least one of the following subunits: a x subxmit having a 
partial amino acid sequence corresponding to SEQ. ID. No. 182; a p subunit having 
an amino acid sequence corresponding to SEQ ID. No. 174; a 5 subxmit having an 
amino acid sequence corresponding to SEQ. ID. No. 178; a 5' subunit having an 
amino acid sequence cprresponding to SEQ. ID. No. 180; a PolC subxmit having an 
amino acid sequence corresponding to SEQ. ID. Nos. 1 84; as well as variants, 
including allelic variants, muteins, anialogs and fragments of any of flie subxmits, and 
compatible combinations thereof, capable of functioning in DNA amplification and 
sequencing. 

The invention also includes and extends to the use and application of 
the enzyme and/or one or more of its components for DNA molecule amplification 
and sequencing by the methods set forth hereinabove, and in greater detail later on 
herein. 

One of the subunits of the invention is the TJh. yh subxmit encoded by 
a dnaXgcne, which frameshifts as much as -2 with high efficiency, and that, upon 
frameshiftlng, leads to the addition of more than one extra amino acid residue to the 
C-terminus (to form the y subxmit). Further, the invention likewise extends to a dnaX 
gene derived from a thermophile such as T.th., that possesses the frameshift defined 
herein and that codes for expression of the y and x subxmits of DNA Polymerase III. 

The present invention provides methods for amplifying or sequencing 
a nucleic acid niolecxile comprising contacting the nucleic acid molecule with a 
composition comprising a DNA polymerase III enzyme (DNA pol III) complex (for 
sequencing, preferably a DNA pol III complex that is substantially reduced in 3'-5' 
exonuclease activity). DNA pol III complexes used in the methods of the present 
invention are thermostable. 

The invention also provides DNA molecules amplified by the present 
methods, methods of preparing a recombinant vector comprising inserting a DNA 



molecule amplified by the present methods into a vector, which is preferably an 
expression vector, and recombinant vectors prepared by these methods. 

The invention also provides methods of preparing a recombinant host 
cell comprising inserting a DNA molecule amplified by the present methods into a 
host cell, which preferably a bacterial cell, most preferably an Escherichia coli cell; a 
yeast cell; or an animal cell, most preferably an insect cell, a nematode cell or a 
mammalian cell. The invention also provides and recombihismt host cells prepared by 
these methods. 

In additional preferred embodiments, the present invention provides 
kits for amplifying or sequencing a nucleic acid molecule. DNA amplification kits 
according to the invention comprise a carrier means having in close confinement 
therein two or more container means, wherein a first container means contains a DNA 
polymerase III enzyme complex and a second container means contains a 
deoxynucleoside triphosphate. DNA sequencing kits according to the present 
invention comprise a multi-protein Pol Ill-type enzyme complex and a second 
container means pontains a dideoxynucleoside triphosphate. The DNA pol III 
contained in the contamer means of such kits is preferably substantially reduced in 5'- 
3' exonuclease activity, may be themiostable, and may be isolated fi:om the 
Ibemiophilic cellvilar sources described above. 

DNA pol Ill-type enzyme complexes for use in the present invention 
may be isolated fi-om any organism that produced the DNA pol Ill-type enzyme 
complexes naturally or recombinantly. Such enzyme complexes may be 
thermostable, isolated from a variety of thermophilic organisms. 

The thermostable DNA polymerase Ill-type enzymes or complexes 
that are an important aspect of this invention, may be isolated firom a variety of 
theraiophilic bacteria that are available commercially (for example, from American 
Type Culture Collection, Rockville, Maryland). Suitable for use as sources of 
thermostable enzymes are the thermophilic eubacteria Aquifex aeolicus and other 
species of the Aquifex genus; Thermus aguaticus, Thermus ihermophilus, Thermus 
flcxvus, Thermus ruber, Thermus brockianus, and other species of the Thermus genus; 
Bacillus stearothermophilus. Bacillus subtilis, and other species of the Bacillus genus; 
Therrhoplasma acidophilum and other species of the Thermoplasma genus; 
Thermotoga neapolitana, Thermotoga maritima and other species of the Thermotoga 
genus; and mutants of each of these species. It will be understood by one of ordinary 



skill in the art, however, that any thermophilic microorganism might be used as a 
SQurce of thermostable DNA pol Ill-type enzymes and polypeptides for use in the 
methods of the present invention. Bacterial cells may be grown according to standard 
microbiological techniques, using culture media and incubation conditions suitable for 
growing active cultures of the particular thermophilic species that are well-known to 
one of ordinary skill in the art (see, e.g., Brock et al., 1969; Oshima et al., 1974). 
Thermostable DNA pol III complexes may then be isolated from such thermophilic 
cellular sources as described for thermolabile complexes above. 

Several methods are available for identifying homologous nucleic 
acids and protein subunits in other thermophilic eubacteria, either those listed above 
or otherwise. These methods include the following: 

(1) The following procedure was used to obtain the genes encoding 
TJh. e idnaQ), xly (dnaX), DnaA (dnoA), and p (dnaN). Protein sequences encoded 
by genes of non-thermpphilic bacteria (i.e., mesbphiles) are aligned to identify highly 
conserved amino acid sequences. PGR primers at conserved positions are designed 
using the codon usage of the organism of interest to amplify an internal section of the 
gene from genomic DNA extracted from the organism. The PGR product is 
sequenced. New primers are designed near the eiids of the sequence to obtain new 
sequence that flanks the ends using circular PGR {also called inversed PGR) on 
genomic DNA that has been cut with the appropriate restriction enzyme and ligated 
into circles. These new PGR products are sequenced. The procedure is repeated xmtil 
the entire gene sequence has been obtained. Also, dnaN (encoding p) is located next 
to dnoA in bacteria and, therefore, dnaN can be obtained by cloning DNA flanking the 
dnuA gene by the circular PGR procedure starting within dnoA, Once the gene is 
obtained, it is cloned into an expfesision vector for protein production. 

(2) The following procedure was used to obtain the genes encoding 
TJh a polymerase (dnoE gene). The DNA polymerase III cian be purified directly 
from the organism of interest and amino acid sequence of the subunit(s) obtained 
directly. In the case of TJh., TJh. cells were lysed and proteins were fractionated. An 
antibody against E. coli a was used to probe column fractions by Western analysis, 
which reacted with r./Zi. a. The Tth a was transferred to a membrane, proteolyzed, 
and fragments were sequenced. The sequence was used to design PGR primers for 
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amplification of an internal section of the dnaE gene. Remaining flanking sequences 
are tfien obtained by circular PCR. 

(3) The following procedure can be used to identify published 
nucleictide sequences which have not yet been identified as to their function. This 
method was used to obtain TJh 8 (holA) and 5* QiolE), although they could 
presumably also have been obtained via Methods 1 and 2 above. Discovery of T.th 
dnaE (a), dnaN (p) and dnaX {%ly) indicates that thermophiles use a class III type of 
DNA polymerase (a) that utilize a clamp (p) and must also use a clamp loader since 
they have T/y. Also, the biochemical experiments in the Examples infra show that the 
T.th polymerase functions with the T.th. p clamp. Having demonstrated that a 
thermophile (e.g., Tth) does indeed utilize a class III type of polymerase with a 
clamp and clamp loader, it can be assumed that they may have 5 and 5* subunits 
needed to form a complex with x/y for functional clamp loading activity (i.e., as shown 
in E. colii 5 and 6' bind either t or y to form tSS' or y55' complex, both of which are 
functional claihp loaders). The 5 subunit is not very well conserved, but does give a 
match in the sequence databases for A.ae,^ T.ma^ and TJh, The TJh, database 
provided limited information on the amino acid sequence of 5 subunit, although one 
can easily obtain the complete sequence of TJh holA hy PCR and circular PCR as 
outlined above in Method 1. The^.ae. and T.ma. databases are complete and, 
therefore, the entire holA sequence from these genomes are identified. Neither 
database recognized these sequences as 5 encoded by holA. The 5' subunit (holB) is 
fairly well conserved. Again the incomplete TJh. database provided limited 5' 
sequence, but as with 5, it is a straight forward process for anyone experienced in the 
area to obtain the rest of the holB sequence using PCR and circular PCR as described 
in Method 1. Neither the yl.ae. nor Tma, databases recognized hqlB encoding 6'. 
Nevertheless, holB was identified as encoding 6* by searching the databases with S* 
sequence. In each case, the Thermatoga maritima and Aquifex aeolicus holB gene 
and 5' sequence were obtained in their entirety. Neither database had previously 
annotated holA or holB encoding 5 and 5'. 

As stated above and in accordance with the present invention, once 
nucleic acid molecules have been obtained, they may be amplified according to any of 
the literature-described manual or automated amplification methods. Such methods 
includes, but are not limited to, PCR (U.S. Patent No. 4,683,195 to Mullis et al. and 
U.S. Patent No. 4,683,202 to Mullis), Strand Displacement Amplification (SDA) 



(U.S, Patent No. 5,455, 1 66 to Walker), and Nucleic Acid Sequence-Based; 
Amplification (NASBA) (U.S. Patent No. 5,409,81 8 to Davey et al.; EP 329,822 to 
Davey et al.). Most preferably, nucleic acid molecules are amplified by the methods 
of the present invention using PCR-based amplification techniques. 

In the initial steps of each of these amplification methods, the nucleic 
acid molecule to be amplified is contacted with a composition comprising a DNA 
polymerase belonging to the evolutionary "family A" class (e.g., Taq DNA pol I or E. 
coli pol I) or the "family "B" class (e.g., Vent and PJii DNA polymerases - see Ito 
and Braithwaite, 1991). All of these DNA polymerases are present as single subimits 
and are primarily involved in DNA repair. In contrast, the DNA pol Ill-type enzymes 
are multisubimit complexes that mainly function in the replication of the 
chromosome, and the subunit containing the DNA polymerase activity is in the 
"family C" class. 

Thus, in amplifying a nucleic acid molecule according to the methods 
of the present invention, the nucleic acid molecule is contacted writh a composition 
comprising athermostable DNA pol Ill-type enzyme complex. 

Once the nucleic acid molecule to be amplified is contacted with the 
DNA pol in-type complex, the amplification reaction may proceed according to 
standard protocols for each of the abpve-described techniques. Since most of these 
techniques comprise a high-temperature denaturation step, if a thermolabile DNA pol 
Ill-type enzyme complex is used in nucleic acid amplification by any of these 
techniques the enzyme would need to be added at the start of each amplification 
cycle, since it would be heat-inactivated at the denaturation step. However, a 
thermostable DNA pol Ill-type complex used in these methods need only be added 
once at the start of the amplification (as for Ta^ DNA polymerase in traditional PGR 
amplifications), as its activity will be unaffected by the high temperatvire of the 
denaturation step. It should be noted, however, that because DNA pol Ill-type 
enzymes may have a much more rapid rate of nucleotide incorporation than the 
polymerases commonly used in these amplification techniques, the cycle times may 
need to be adjusted to shorter intervals than would be standard. 

In an alternative preferred embodiment, the inviention provides 
methods of extending primers for several kilobases, a reaction that is central to 
amplifying large nucleic acid molecules, by a technique commonly referred to as 
"long chain PGR" (Barnes, 1994; Gheng, 1994). 
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. In such a method the target primed DNA can contain a single strand 
stretch of DNA to be copied into the double strand form of several or tens of . 
kilobases. The reaction is performed in a suitable buffer, preferably Tris, at a pH of 
between 5.5 - 9.5, preferably 7.5. The reaction also contains MgCh in the range 1 
5 mM to 10 mM, preferably 8 mM, and may contain a suitable salt such as NaCl, KCl 
or sodium or potassium acetate. The reaction also contains ATP in the range of 20 

to 1 mM, preferably 0.5 mM, that is needed for the clamp loader to assemble the 
clamp onto the primed template, and a sufficient concentration of deoxynucleoside 
triphosphates in the range of 50|iM to 0.5 mM, preferably 60 (iM for chain extension: 
10 The reaction contains a sliding clamp, such as the B subunit, in the range of 20hg to 
200 ng, preferably 100 ng, for action as a clamp to stimulate the DNA polymerase. 
The chain extension reaction contains a DNA polymerase and a clamp loader, that 
could be added either separately or as a single Pol III* -like particle, preferably as a 
Pol in* like particle that contains the DNA polymerase and clamp loadmg activities. 
15 The Pol lll-type enzyme is added preferably at a concentrations of about 6.0002-200 
vinits per milliliteir, about 0.002- 1 00 units per milliliter, about 0.2-50 units per 
milliliter, and most preferably about 2-50 units per milliliter. The reaction is 
incubated at elevated temperature, preferably 60*^0 or more, and could include other 
proteins to enhance activity such as a single strand DNA binding protein. 
20 In another preferred embodiment, the invention provides methods of 

extending primers on linear templates in the absence of the clamp loader. In this 
reaction, the primers are annealled to the linear DNA, preferably at the ends such as in 
standard PGR applications. The reaction is performed in a suitable buffer, preferably 
Tris, at a pH of between 5.5 - 9.5, preferably 7,5. The reaction also contains MgCb in 
25 the range of 1 mM to 10 mM, preferably 8 mM, and may contain a suitable salt such 
as NaCl, KCl or sodium or potassium acetate. The reaction also contains a sufficient 
concentration of deoxynucleoside triphosphates in the range of 50^M to 0.5 mM, 
preferably 60 nM for chain extension. The reaction contains a sliding clamp, such as 
the B subunit, in the range of 20ng to 20 ^g, preferably about 2 jig, for ability to slide 
30 on the end of the DNA and associate with the polymerase for action as a clamp to 

stimulate the DNA polymerase. The chain extension reaction also contains a Pol Ill- 
type polymerase subunit such as a, core, or a Pol III* -like particle. The Pol Ill-type 
enzyme is added preferably at a concentrations of about 0.0002-200 units per 



milliliter, about 0.002-1 00 units per milliliter, about 0.2-50 xmits per milliliter, and 
most preferably about 2-50 units per milliliter. The reaction is incubated at elevated 
temperature, preferably 60°C or more, and could include other proteins to enhance 
activity such as a single strand DN A binding protein. 

The methods of the present invention thus will provide high-fidelity 
amplified copies of a nucleic acid molecule in a more rapid fashion than traditional 
amplification methods using the repair-type enzymes. 

These amplified nucleic acid molecules may then be manipulated 
according to standard recombinant DNA techniques. For example, a nucleic acid 
molecule amplified according to the present methods may be inserted into a vector, 
which is preferably an expression vector, to produce a recombinant vector comprising 
the amplified nucleic acid molecule. This vector may then be inserted into a host cell, 
where it may, for example, direct the host cell to produce a recombinant polypeptide 
encoded by the amplified nucleic acid molecule. Methods for inserting nucleic acid 
molecules into vectors, and inserting these vectors into host cells, are well-known to 
one of ordmary sldll in the art (see, e.g., Mahiatis, 199i2^^ 

Alternatively, the amplified nucleic acid molecules may be directly 
inserted into a host cell, where it niay be incorporated into the host cell genome or 
may exist as an extrachromosomal nucleic acid molecule, thereby producing a 
recombinant host cell. Methods for introduction of a nucleic acid molecule into a host 
cell, including calcium phosphate transfection, DEAE-dextran mediated transfection, 
cationic lipid-mediated transfection, electroporation, transduction, infection or other 
methods, are described in many standard laboratory manuals (see, e.g., Davis, 1986). 

For each of the above techniquesi wherein an amplified nucleic acid 
molecule is introduced into a host cell via a vector or via direct introduction, preferred 
host cells include but are not limited to a bacterial cell, a yeast cell, or an animal cell. 
Bacterial host cells preferred in the present invention are E. coli, Bacillus spp., 
Streptomyces spp., Erwinia spp., Klebsiella spp. and Salmonella typhimurium. 
Preferred as a host cell is E. coli, and particularly preferred are E. coli strains DHIOB 
and Stbl2, which are available conunercially (Life Technologies, Inc. Gaithersburg, 
Maryland). Preferred animal host cells are insect cells, nematode cells and 
mammalian cells. Insect host cells preferred in the present invention are Drosophila 
spp. cells, Spodoptera Sf9 and Sf21 cells, and Trichoplusa High-Five cells, each of . 
which is available commercially (e.g., from Invitrogen; San Diego, California). 



Preferred nematode host cells are those derived from C. elegans, and preferred 
mammalian host cells are those derived from rodents, particularly rats, mice or 
hamsters, and primates, particularly monkeys and humans. Particularly preferred as 
mammalian host cells are CHO cells, COS cells and VERO cells. 

By the present invention, nucleic acid molecules may be sequenced 
according to any of the literature-described manual or automated sequencing methods. 
Such methods include, but are not limited to, dideoxy sequencing methods such as 
"Sanger sequencing" (Sanger and Coulson, 1975; Sanger et al., 1977; U.S. Patent No. 
4,962,022 to Fleming et al.; and U.S. Patent No. 5,498,523 to Tabor et al.), as well as 
more complex PCR-based nucleic acid fingerprinting techniques such as Random 
Amplified Polymorphic DNA (RAPD) analysis (Williams e^^ Arbitrarily . 

Primed PGR (AP-PCR) (Welsh and McClelland, 1 990), DNA Amplification 
Fingerprinting (DAF) (Gaetano-Anoll6s, 1991), microsatellite PGR or Directed 
Amplification of Minisatellite-region DN A (D AMD) (Heath et al., 1993), and 
Amplification Fragment Length Polymorphism (AFLP) analysis (EP 534,858 to Vos 

et al.; Vos et al , 1995; Lin and Kuo, 1 995). 

As described above for amplification methods, the nucleic acid 

molecule to be sequenced by these methods is typically contacted with a composition 
comprising a type A or type B DNA polymerase. By contrast, in sequencing a nucleic 
acid molecule according to the methods of the present invention, the nucleic acid 
molecule is contacted with a composition comprising a thermostable DNA pol Ill- 
type enzyme complex instead of necessarily using a DNA polymerase of the family A 
or B classes. As for amplification methods, the DNA pol Ill-type complexes used in 
the nucleic acid sequencing methods of the present invention are preferably 
substantially reduced in 3'-5' exonuclease activity; most preferable for use in the 
present methods is a DNA polymerase Ill-type complex which lacks the e subunit. 
DNA pol Ill-type complexes used for nucleic acid sequencing according to the 
present methods are used at the same preferred concentration ranges described above 
for long chain extension of primers. 

Once the nucleic acid molecule to be sequenced is contacted v^th the 
DNA pol III complex, the sequencing reactions may proceed according to the 
protocols disclosed in the above-referenced techniques. 

As discussed above, the invention extends to kits for use in nucleic 
acid amplification or sequencing utilizing DNA polymerase Ill-type enzymes 
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according to the present methods. A DNA amplification kit according to the present 
invention may comprise a carrier means, such as vials, tubes, bottles and the like. A 
first such container means may contain a DNA polymerase Ill-type enzyme complex, 
and a second such container means may contain a deoxynucleoside triphosphate. The 
amplification kit encompassed by this aspect of the present invention may further 
comprise additional reagents and compoxmds necessary for carrying out standard 
nucleic amplification protocols (See U.S. Patent No. 4,683,195 to MuUis et al. and 
U.S. Patent No. 4,683,202 to MuUis, which are directed to methods of DNA 
amplification by PGR). 

Similarly, a DNA sequencing kit according to the present invention 
comprises a carrier means having in close confinement therein two or more container 
means, such as vials, tubes, botdes and the like. A first such container means may 
contain a DNA polymerase Ill-type enzyme complex, and a second such container 
means may contain a dideoxynucleoside triphosphate. The sequencing kit may 
further comprise additional reagents and compoxmds necesisary for carrying .out 
standard nucleic sequencing protocols, such as pyrophosphatase, agarose or 
polyacrylamide media for formulating sequencing gels, and other components 
necessary for detection of sequenced nucleic acids (See U.S. Patent No. 4,962,020 to 
Fleming et al. and U.S. Patent No. 5,498,523 to Tabor et al., which are directed to 
methods of DNA sequencing). 

The DNA polymerase Ill-type complex contained in the first container 
means of the amplification and sequencing kits provided by the invention is 
preferably a thermostable DNA polymerase Ill-type enzyme complex and more 
preferably a DNA polymerase Ill-type enzyme complex that is reduced in 3-5' 
exonuclease activity; Naturally, the foregoing methods and kits are presented as 
illustrative and not restrictive of the use and application of the enzymes of the 
invention for DNA molecule amplification and sequencing. Likewise, the 
applications of specific embodiments of the enzymes, including conserved variants 
and active fragments thereof are considered to be disclosed and included within the 
scope of the invention. 

As discussed earlier, individual subunits could be modified to 
customize enzyme construction and corresponding use and activity. For example, the 
region of a that interacts with B could be subcloned onto another DNA polymerase, 
thereby causing B to enhance the activity of the recombinant polymerase. 
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AJtematively, the 13 clamp could be modified to fimction with another protein or 
enzyme thereby enhancing its activity or acting to localize its action to a particular 
targeted DNA. Finally, the polymerase active site could be modified to enhance its 
action, for example changing Tyrosine enabling more equal site stoppage vsdth the 
four ddNTPs (Tabor et al., 1 995). TTiis represents a particular non-limiting 
illustration of the scope and practice of the present invention with reference td the 
utility of individual subimits hereof 

Accordingly and as stated above, the present invention idso relates to a . 
recombinant DNA molecule or cloned gene, or a degenerate variant thereof, which 
encodes any one or all of the subunits of the DNA Polymerase Illrtype enzymes of the 
present invention, or active fragments thereof In the instance of the t subunit, a 
predicted molecular weight of about 58 kD and an amino acid sequence set forth in 
SEQ ID Nos. 4 or 5 is comprehended; preferably a nucleic acid molecule, in particular 
a recombinant DNA molecule or cloned gene, encoding the 5 8 kD subunit of the 
Polymerase III of the invention, that has a nucleotide sequence or is complementary to 
a DNA sequence shown in FIGURES 4A and 4B (SEQ ID No. 1), and the coding 
region for^noA^set forth in FIGURE 4C (SEQ ID No. 3). The y subunit is smaller, 
and is approximately 50 kD, depending upon the extent of the firameshift that occurs. 
More particularly, and as set forth m FIGURE 4E (SEQ ID No. 4), the y subunit 
defined by a -1 firameshift possesses a molecular weight of 50.8 kD, while the y 
subunit defined by a -2 frameshift, set forth in FIGURE 4F (SEQ ID No. 5), possesses 
a molecular weight of 49.8 kD. 

As discussed above, the invention also extends to the gene;s including 
holA, holB, dnaX, dnaQ, dnaE, and dnaN from thermophilic eubacteria (i.e„ TJh. and 
A.ae.) that have been isolated and/or purified, to corresponding vectors for the genes, 
and particularly, to the vectors disclosed herein, and to host cells including such 
vectors. In this connection, probes have been prepared which hybridize to the DNA 
polymerase Ill-type enzymes of the present invention, and v^hich are selected firom 
the various oligonucleotide probes or primers set forth in the present application. 
These include, without limitation, the oligonucleotide defined in SEQ ID No. 6 the 
oligonucleotide defined in SEQ ID No. 8 the oligonucleotide defined in SEQ ID No. 
10 the oligonucleotide defined in SEQ ID No. 1 1 the oligonucleotide defined in SEQ 
ID No. 12 the oligonucleotide defined in SEQ ID No. 1 3 the oligonucleotide defined 



in SEQ ID No. 14 the oligonucleotide defined in SEQ ID No. 15, and the , 

oligonucleotide defined in SEQ ID No. 1 6. 

The mefliods of the invention include a method for producing a 

recombinant thermostable DNA polymerase IH-type enzyme firom a thermophUic 
bacterium, such as T.th.,AMe., Th.ma., or B.st. which comprises culturing a host cell 
transformed with a vector of the invention und^ conditions suitable for tiie expression 
of the present DNA polymerase HI. Another method includes a method for isolating a 
target DNA fragment consisting essentially of a DNA coding for a thermostable DNA 
polymerase Ill-type en2yme from a thermophilic bacterium comprising the steps of: 

(a) forming a genomic library from the bacterium; 

(b) transforming or transfecting an appropriate host cell with the 

library of step (a); 

(c) contacting DNA from the transformed or transfected host cell with 
a DNA probe which hybridizes to a DNA fragment selected from the group consisting 
of the DNA fragments defined in SEQ ID No. 6 and the DNA fragments defined in 
SEQ ID No. « or the oligonucleotides set forth above; wherein hybridizatioii is 
conducted under the following conditions: 

i) hybridization: 1% crystalline BSA (fraction V) (Sigma)i 

1 mM EDTA. 0.5 M NaHP04 (pH 712), 7% SDS at eS'^C for 12 hours and; 

ii) wash: 5 X 20 minutes vvith wash buffer consisting of 
0.5% BSA, fiction V), ImM Na2EDTA, 40 mM NaHP04 (pH 7.2), and 5% SDS; 

(d) assaying the transformed or transfected cell of step (c) which 
hybridizes to the DNA probe for DNA polymerase IH-type activity; and 

(e) isolating a target DNA fi^gment which codes for the thermostable . 

25 DNA polymerase Ill-type enzyme. 

Also, antibodies including both polyclonal and monoclonal antibodies, 

and the DNA Polymerase Ill-like enzyme complex and/or their y and x subunits, a 
subunit(s), 5 subunit; 5' subunit, p subunit, s subunit may be used in the preparation 
of the enzymes of the present invention as well as other enzymes of similair 
30 thermophilic origin. For example, the DNA Polyinerase Ill-type complex or its 
subunits may be used to produce both polyclonal and monoclonal antibodies to 
themselves in a variety of cellular media, by known techniques such as the hybridoma 
technique utilizing, for example, fused mouse spleen lymphocytes and myeloma cells. 
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The general methodology for making monoclonal antibodies by 
hybridomas is well known. Immortal, antibody-producing cell lines can also be 
created by techniques other than fusion, such as direct transformation of B 
lymphocytes with oncogenic DNA, or transfection with Epstein-Barr virus. See, e.g., 
Schreier et al., 1980; Hammerling et al., 1981; Kennett et al., 1980; see alsoU.S. 
Patent No. 4,341^761 to Ganfield et al.; U.S. Patent No. 4,399,121 to Albarella et al.; 
U.S. Patent No. 4,427,783 to Newman et al.; U.S. Patent No: 4,444,887 to Hoffinai^ 
U.S. Patent No. 4,451,570 to Royston et al.;U.S. Patent No. 4,466,917 to 
Nussenzweig et al.; U.S. Patent No. 4,472,500 to Milstein et al.; U.S. Patent No. 
4,491 ,632 to Wands et al.; and U.S. Patent No. 4,493,890 to Morris. 

Methods for producing polyclonal anti-polypeptide antibodies are 
well-known in the art. See U.S. Patent No. 4,493,795 to Nestor et al. A monoclonal 
antibody, typically containing Fab and/or F(ab')2 portions of useful antibody 
molecules, can be prepared using the hybridoma technology described in Antibodies - 
A Laboratory Manual, Uai\aw and Lane, eds.. Cold Spring Harbor Laboratory, New 
York (1988), vsiiich is incorporated herein by reference. Briefly, to form the 
hybridoma from which the monoclonal antibody composition is produced, a myeloma 
or other self-perpetuating cell line is fused with lymphocytes obtained from the spleen 
of a maimnal hyperimmimized with an elastin-binding portion thereof. 

A monoclonal antibody useful in practicing the present invention can 
be produced by initiating a monoclonal hybridoma culture comprising a nutrient 
medium containing a hybridoma that secretes antibody molecules of the appropriate 
antigen specificity. The culture is maiiitained under conditions and for a time period 
sufficient for the hybridoma to secrete the antibody molecules into the medium. The 
25 antibody-containing medium is then collected. The antibody molecules can then be 
further isolated by well-known techniques. 

Media useful for the preparation of these compositions are both well- 

knov.!! in the art and commercially available and include synthetic culture media, 
mbred mice and the like. An exemplary synthetic medium is Dulbecco's minimal 
30 essential medium (DMEM) (Dulbecco et al., 1 959) supplemented with 4.5 gm/1 

glucose, 20 mm glutamine, and 20% fetal calf serum. An exemplary inbred mouse 

strain is the Balb/c. 

Another feature of this invention is the expression of the DNA 
sequences disclosed herein. As is well known in the art, DNA sequences may be 



expressed by operatively linking them to an expression control sequence in an 
appropriate expression vector and employing that expression vector to transform an 
appropriate unicellular host. 

Such operative linking of a DN A sequence of this invention to an 

expressioSn control sequence, of course, includes, if not already part of the DNA 
sequence, the provision of an initiation codon, ATG, in the correct reacUng fr^ 

upstream of the DNA sequence. 

A \\dde varieity of host/expression vector combinations may be 
employed in expressing the DNA sequences of this invention. Useful expression 
vectors, for example, may consist of segments of chromosomal, non-chromosomdl 
and synthetic DNA sequences. Suitable vectors include derivatives of SV40 and 
known bacterial plasmids, e.g., E, coli plasmids col El, pCRl , pBR322, pMB9 and 
their derivatives, plasmids such as RP4; phage DNAS, e.g., the numerous derivatives 
of phage X, e.g., NM989, and other phage DNA, e.g.. Ml 3 and filamentous single 
stranded phage DNA; yeast plasmids such as the 2\x. plasmid or derivatives thereof; 
vectors useful hi eukaryotic cells, such as vectors useful in insect or mammalian cells; 
vectors derived froni combinations of plasmids and phage DNAs, such as plasniids 
that have been modified to employ phage DNA or other expression control sequences; 
and the like. 

Any of a wide variety of expression control sequences — sequences 
that control the expression of a DNA sequence bperatively linked to it - may be used 
in these vectors to express the DNA sequences of this invention. Such useful 
expression control sequences include, for example, the early or late prompters of 
SV40, CMV, vaccinia, polyoma or adenovirus, the lac system, the trp system, the 
TAC system, the TRC system, the LTR system, the major operator and promoter 
regions of phage ^, the control regions of fd coat protein, the promoter for 
3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid 
phosphatase (e.g., Pho5), the promoters of the yeast a-mating factors, and other 
sequences known to control the expression of genes of prokaryotic or eukaryotic cells 
or their viruses, and various combinations thereof. 

A wide variety of unicellular host cells are also useful in expressing the 
DNA sequences of this invention. These hosts may include well known eukaryotic 
arid prokaryotic hosts, such as strains of E. coli, Pseudomonas, Bacillus , 
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Streptomyces, ftingi such as yeasts, and animal cells, such as CHO, Rl.l, B-W and 
L-M cells, African Green Monkey kidney cells (e.g., COS 1, COS 7, BSCl, BSC40, 
and EMTIO), insect cells (e.g., Sf9), and human cells and plant cells in tissue culture. 

It ynW be understood that not all vectors, expression control sequences 
and hosts will function equally well to express the DNA sequences of this invention. 
NeithCT will all hosts function equally well with the same expression system. 
However, one skilled in the art will be able to select the propCT vectors, expression 
control sequences, and hosts without undu^ experimentation to accomplish the desired, 
expression without departing from the scope of this invention. For example, in 
selecting a vector, the, host must be considered because the vector must function in it. 
The vector's copy number, the ability to control that copy number, and the expression 
of any other proteins encoded by the vector, such as antibiotic markers, wiU also be 
considered. 

In selecting an expression control sequence, a variety of factors will 
normally be considered. These include, for example, the relative strength of the 
system, its controllability, and its compatibility with the particular DNA sequence or 
gene to be expressed, particularly with regard to potential secondary structures. 
Suitable unicellular hosts will be selected by consideration of, e.g., their compatibility 
with the chosen vector, their secretion characteristics, their ability to fold proteins 
correctly, and their fermentation requirements, as well as the toxicity to the host of the 
product encoded by the DNA sequences to be expressed, and the ease of purification 
of the expression products. 

Considering these and other factors a person skilled in the art will be 
able to construct a variety of vector/expression control sequence/host combinations 
25 that will express the DNA sequences of this invention on fermentation or ui large 
scale animal culture. 

It is further intended that analogs may be prepared from nucleotide 
sequences of the protein complex/subunit derived within the scope of the present 
invention. Analogs, such as fragments, may be produced, for example, by pepsin 
30 digestion of bacterial material. Other analogs, such as muteins, can be produced by 
standard site-directed inutagenesis of dnaX, dnaE, dnaQ, dnoN, holA, or holB coding 
sequences. Especially useful may be a mutation in dnaE that provides the polymerase 
with the ability to incorporate all four ddNTPs with equal efficiency thereby 
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producing an even binding pattern in sequencing gels, as discussed above and with 
reference to Tabor et al:, 1995. 

As mentioned above, a DNA sequence corresponding to dnaXidngQ^ 
hold, holBj dnaEy ov dnaNy or encoding the subunits of the DNA Polymerase in of the 
5 inventipn can be prepared synthetically rather than cloned. The DNA sequence can 
be designed with the aprpropriate codons for the amino acid sequence of the subunit(s) 
of interest. In general, one will select preferred codons for the intended host if the 
sequence will be used for expression. The complete sequence is assembled from 
overlapping oligonucleotides prepared by standard methods and assembled into a 
10 complete coding sequence (Edge, 1981; Nambair et al., 1984; Jay et al., 1984). / 
Synthetic DNA sequences allow convenient construction of genes 
which will express DNA Polymerase III analogs or "muteins". Altematiyely, DNA 
encoding muteins can be made by site-directed mutagenesis of native dnaX^ druzQy 
holA, holBy dnaE or dnoN genes or their corresponding cDN As, and muteins can be 
15 made directly using conventidnal polypeptide synthesis, 

A general method for site-specific incorporation of \mnatural amino 
acids into proteins is described in Noren et al., 1989. This method may be used to 
create analogs with unnatural amino acids. 

20 GENERAL DESCRIPTION OF THE INVENTION 

As discussed above, the present invention has as one of its 
characterizing features, that a Polymerase Ill-type enzyme as defined hereinabove, 
has been discovered in a thermophile, that has the structure and fimction of a 

25 chromosomal replicase. This structure and function confers significant benefit when 
the enzyme is employed in procedures such as PGR where speed and accuracy of 
DNA reconstruction is crucial. 

Chromosomal replicases are composed of several subunits in all 
organisms (Romberg and Baker, 1992). In keeping with the need to replicate long 

30 chromosomes, replicases are rapid and highly processive multiprotein machines. All 
cellular replicases examined to date derive their processi^dty fi-om one subunit that is 
shaped like a ring and completely encircles DNA (Kuriyan and O'Donnell, 1993; 
Kelman and O'Donnell, 1994). This "sliding clamp" subimit acts as a mobile tether 
for the polymerase machine (Stukenberg et al., 1991). The sliding clamp does not. 
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assemble onto the DNA by itself, but requires a complex of several proteins, called a 
'^clamp loader^' which couples ATP hydrolysis to the assembly of sliding clamps onto 
DNA (O'Donnell et al., 1992). Hence, Pol Ill-type cellular replicases are comprised 
of three Components: a clamp, a clamp loader, and the DNA polymerase. 
5 An overall goal is to identify and isolate all of the genes encoding the 

replicase subimits from a thermophile for expression and pimfication in large 
quantity. Following this, the replication apparatus can be reassembled from 
individual subunit components for use in kits, PGR, sequencing and diagnostic 
applications (Oimist et al., 1 995). 

10 As a beginning to identify and characterize.the replicase of a 

thermophile, we started by looking for a homologue to the prokaryotic dndX^ene 
which encode subunits (y and x) of the replicase. The dnaXgene has another 
homologue, holB, which encodes yet another subunit (5') of the replicase. The anmio 
acid sequence of 8' (encoded by /ioi/<) and x/y subunits (encoded by rfm^ 

1 5 particularly highly conserved in evolution from prokaryotes to eukaryotes (Chen et 
al., 1992; O'DonhfeU et al., 1993; Onrust et al,, 1993; Carter et al., 1993; Culhnan et 
al., 1995): 

One organism chosen for study and exposition herein is the exemplary 
extreme thermophile Thermus thermophilus {TJh,). It is understood that other 

20 members of the class such as the eubacterium Thermatoga are expected to be 

analogoiis in both structure and function. Thus, the investigation of T.th, proceeded 
and initially, a TJh. homologue of dnaXv/as identified. The gene encodes a full 
length protein of 529 amino acids. The amino terminal third of the sequence shares 
over 50% homology to dnaX genes as divergent as E. coli (gram negative) and A 

25 subtilis (gram positive). The T,th, dnaX gene contains a DNA sequence that provides 
a transiational frameshift signal for production of two proteins from the same geiie. 
Such fraiheshifting has been dociraiented only in the case ofE, coli (Tsuchihashi and 
Romberg, 1990; Flower and McHenry, 1990; Blinkowa and Walker, 1990). No 
frameshifting has been documented to occur in the dncOChomoXoga^s (RFC subunit 

30 genes) of yeast arid humans (Eukaryotic kingdom). 

The presence of a dnaX gene that produces two subunits implies that 
T,th. has a clamp loader (y) and may be organized by x into a PolIII*-type replicase 
like the replicative DNA polymerase of Escherichia coli, DNA polymerase III 
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holoenzyme. The E. coli DNA pblymerase III holoenzyme contains 10 dijfferent 
subunits, some in copies of two or more for a total composition of 1 8 polypeptide 
chains (Romberg and Baker, 1992; Onrust et al., 1995). The holoenzyme is 
composed of three major activities: the S-subunit DNA polymerase core (aiBO), the P 
5 subunit DNA sliding clamp, and the Srsubimit y complex clamp loader (y55'xv)- This 
3 component strategy generalizes to eukaryotes which utilize a clamp (PCNA) and a 
5-subunit RFC clamp loader (RFC) which provide processivity to DNA polymerase 6 
(reviewed in Kelman and O'Donnell, 1994). 

In E coli, the polymerase and clamp loader components are organized 

10 into one PoUII* particle by the x subunit, that acts as a "glue" protein (Onrust et al , 
1995). One dimer of x holds together two core polymerases in the particle which are 
utilized for the coordinated and simultaneous replication of both strands of duplex 
DNA (McHenry, 1982; Maki et al., 1988; Yuzhakov et al., 1996). The "glue" protein 
X subunit also binds one clamp loader (called y complex) thereby acting as a scaffold 

15 for a large superstructure assembly called DNA polynierase III*. The gene encoding 
X, called dnaX, also encodes the y subimit of DNA polymerase III. The P subunit then 
associates with Pol III* to form the DNA polymerase III holoenzyme. The y subunit 
is approximately 2/3 the length of x. y shares the N-terminus of x, but is truncated by 
a translational fiameshifting mechanism that, after the shift, encounters a stop codon 

20 within two amino acids (Tsuchihashi aiid Romberg, 1 990; Flower and McHenry, 

1 990; Blinkowa and Walker, 1 990). Hence, y is the N-te:rminal 453 amino acids of x, 
but contains one unique residue at the C-terminus (the penultimate codon encodes a 
Lys residue which is the same sequence as if the frameshift did not take place). This 
frameshift is highly efficient and occurs approximately 50% of the time. 

25 The sequence ofthe y and X subunits encoded by the c/woA:' gene are 

homologous to the clamp loading subimits in all other organisms extending from 
gram negative bacteria through gram positive bacteria, the Archeae Kingdom and the 
Eukaryotic Kingdom from yeast to humans (O'Donnell et al., 1993). AH of these 
organisms utilize a three component replicase (DNA polymerase, clamp and clamp 

30 loader) and in these cases the 3 components appear to behave as independent units in 
solution rather than forming a large holoenzyme superstructure. For example, in 
eukaiyotes from yeast to humans, the clamp loader is the five subunit RFC, the clamp 



is PCN A, and the polymerases 5 and s are all stimulated by the PCNA clamp 
assembled onto pruned DNA by RFC (reviewed in Kelman and O'Donnell 1994). 

The discovery of a dnaX gene in TJh. provided confidence ihat 
thermophilic bacteria would contain a three component Pol Ill-type enzyme. Hence, 
we proceeded to identify thc dnaQ and dnaN genes encoding, respectively, the 
proofreading 3-5" exonuclease, and the B DNA sliding clamp subunits of a Pol 
Ill-type enzyme. Following this, we purified fi-om extracts of T.th cells, a Pol Ill- 
type enzyme. This enzyme preparation had the unique property of extending a single 
primer aro\md a long 7.2 kb single strand DNA genome of M13mpl8 bacteriophage. 
Such a primer extension assay serves as a tool to detect and identify the Pol Ill-type of 
enzyme in cell extracts. The enzyme was confirmed to be a Pol Ill-tj^ enzyme 
based on its reactivity with antibody directed against the E. coli a subxmit (the? DNA 
polymerase subuiiit) and antibody directed against E. coli y subimit, Proteins 
corresponding to a, x, y, 6 and S' were easily visible and lend themselves to 
identification of the genes through use of peptide microsequencing followed by 
primer design for PGR amplification. For example, fix>m this DNA pol Ill-type 
preparation, the peptide sequence of the d subxihit was obtained, which then allowed 
the dnaE gene encoding the a subimit (DNA polymerase) of the Pol Ill-type enzyme 
to be obtain. 

These methods should be widely applicable to other thermophilic 
bacteria. Additional antibody reagents against other Pol Ill-type enzyme components, 
such as RFC subunits, DNA polymerase delta, epsilon or beta, and the PCNA clamp 
from, known organisms can be made quite easily as polyclonal or monoclonal 
antibody preparations using as antigen either naturally purified sequence, recombinant 
sequence, or synthetic peptide sequence. Examples of known sequences of these Pol 
Ill-type enzymes are to be found in: DNA polymerases (Braithwaite and Ito, 1993), 
RFC clamp loaders (Cullman et al., 1995) and PCNA (Kelman and O'Donnell, 1995). 

The remaining genes of T.th. Pol III needed for efficient extelnsion of 
primed templates, holA and holB, are now identified. The holA coding sequence 
(SEQ. ID. No. 157) encodes the 5 subunit (SEQ. ID. No. 158) and the holB coding 
sequence (SEQ. ID. No. 155) encodes the 5' subunit (SEQ. ID. No. 156). The holA 
and holB coding sequences and the 5 and 5' subunits were identified via BLAST 
search (Altschul et al., 1997), and subsequently isolated following circular PGR. 



These genes v^dll provide the subunit preparations through 

recombinant techniques and protein purification protocols. The protein subunits can 
then be used to reconstitute the enzyme complexes as they exist in the celL TTiis type 
of reconstitution of Pol III has been demonstrated using the protein subunits of DNA 
polymerase III holoenzyme from E. coli to assemble the entire particle. See, e.g., 
U.S. Patent Nos. 5,583,026 and 5,668,004 to O'Dohnell; and Onrust et al., 1995. The 
disclosxires of these references are incorporated herein in their entireties. 

Another organism chosen for study and exposition herein is tl^^ 
extreme theraiophile Aquifex aeolicus. Thus, the present invention alsb relates to 
various isolated DNA molecules from Aquifex aeolicus^ in particular the DNA 
molecules encoding various replication proteins. These include d>ia£, dnaX^ dnaN^ 
hoUy holB.ssbim A moX^cyj^cs from A. aeolicm. These DNA molecules can be 
inserted into an expression system or used to transform host cells from which isolated 
proteins can be obtained. The isolated proteins encoded by these DNA molecules are 
also disclosed. 

Unless othenvise indicated below, the Aquifex aeolicus sequences were 
obtained by sequence comparisons using the Thermus thermophilus counterparts, as 
query against the genome of Aquifex aeolicus (Deckert et al., 1 998). 

The A. aeolicus dnaE gfinc has a nucleotide coding sequence according 
to SEQ. ID. No. 1 17 and encodes the a subunit of the of DNA Polymerase III, which 
has an amino acid sequence according to SEQ. ID. No. 118. The Amc. a subunit has 
approxunately 41% aa identity to the T.r/i. a subunit. 

The A. aeolicus dnaX geine has a nucleotide coding sequence according 
to SEQ. ID. No. 1 1 9 and encodes the x subunit of the of DNA Polymerase III, which 
has an amino acid sequence according to SEQ. ID. No. 1 20. The A.ae. x subunit has 
approximately 51% aa identity to the r.r/i. X subunit. 

The A. aeolicus dnaN gene has a nucleotide coding sequence according 
to SEQ. ID. No. 121 and encodes the p subunit of DNA Polymerase III, which has an 
amino acid sequence according to SEQ. ID. No. 122. The A.ae: p subunit has 
approximately 27% aa identity to the T.//?. p subunit. 

The A. aeolicus dnaQ gene has a nucleotide coding sequence 
according to SEQ. ID. No. 127 and encodes the e subunit of the of DNA Polymerase 



Ill, which has an amino acid sequence according to SEQ. ID. No. 128. The^.ae. e 
subunit has approximately 26% aa identity to the T.tk s subunit. 

The A. aeolicus ssb gene has a nucleotide coding sequence according 
to SEQ. ID. No. 129 and encodes the SSB protein, which has an amino acid sequence 
according to SEQ- ID. No. 130. The A.de SSB protem has approximately 22% aa 
identity to Ae r./A. SSB protein. 

Further, the coding sequences of A. aeolicus genes encoding the 
helicase (dnoB), helicase loader (dnaQ, and primase (dnaG) are also disclosed. The 
A. aeolicus dnaB gene has a nucleotide coding sequence according to SEQ. ID, No. 
131 and encodes the £)naB protein, which functions as a helicase and has an amino 
acid sequence according to SiEQ. ID. No. 132. The A, aeolicus dnaG gene has a 
nucleotide coding sequence according to SEQ. ID. No. 133 and encodes the DiiaG 
protein, which functions as a primase and has an amino acid sequence according to 
SEQ ID. No; 134. The A, aeolicus dnaC gene has a nucleotide coding sequence 
according to SEQ. ID. No. 135 and encodes the DnaC protein, which functions as a 
helicase loader aiid has an amino acid sequence according to SEQ. ID. No. 136. 

The A. aeolicus holA and holB genes were previously unidentified by 
Deckert et al., 1998. Using Thermus thermqphilus 8* subunit amino acid sequence 
and the Thermatoga maritima 6 subunit amino acid sequence (SEQ. ID. No. 146 
\\4iich itself was obtained using the T.tk 8 subunit amino acid sequence of SEQ. ID. 
No. 1 58) in separate BLAST searches (Altschul et al., 1 997), corresponding 
polypeptide products in Aquifex aeolicus were identified. lh& A. aeolicus hplA gene 
has a nucleotide coding sequence according to SEQ. ID. No. 123 and encodes the 8 
subxmit of the of DNA Polymerase III, which has an amino acid sequence according 
to SEQ: ID. No. 124. the A.ae. 5 subunit has approximately 21% aa identity to the 
r.m. 8 subunit. The A. aeolicus holB gene has a nucleotide coding sequence 
according to SEQ. ID. No. 125 and encodes the 8* subunit of the of DNA Polymerase 
Hi, which has an amino acid sequence according to SEQ. ID. No. 126. ThcA,ae. S' 
subunit has approximately 24% aa identity to the TJh 8' subxmit. 

This invention also clones at least the coding regions of a set of ^. 
aeolicus genes which encode proteins that assemble into an A. aeolicus DNA 
polymerase III replication enzyme. These genes {dnaE, dnaN, dnaX, dnaQ, holA, 
holBy ssb) were cloned into expression vectors, the proteins were expressed in E. coli. 
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and the corresponding protein subunits were piirified (alpha, beta, tau, delta, delta ^ 
prime, SSB). This invention identifies the major protein-protein contacts among these 
subtmits, shovsrs how these proteins can be assembled into higher order multiprotein 
complexes, and how to form a rapid and processive DNA polymerase III holoenzyme. 
5 In contrast to the E. coli and T. thermophilus dndX genes which encode 

both T and y subunits, the A. aeolicus dnaX gene produces only the fiill length t 
subunit when expressed in E. colL The A. aeolicus x is intermediate in length 
between the y and x subunits ofE. coli DNA polymerase in holoenzymje. The E. coli 
T binds the y subunit does not bind a. Due to the intermediate size of A. aeolicus x, 
10 it was not known whether the A. aeolicus x would bind the a subunit. This invention 
shows that indeed, the .4. aeolicus x binds to a, as well as 5 and 5', thereby forming an 
A, aeolicus ax88' complex. Until the identification of the 8 and 8' subunits by the 
present invention, their existence, let alone their interaction with x and a, was not 
even known. 

15 The A. aeolicus ax88Vp Pol III can be applied in several \iseful DNA 

handling techniqueis. For example;, the thenhophilic Pol III will be usefid in DNA 
sequencing, especially at high temperature. Also, use of a thermal resistant rapid and 
processive Pol III is an important improvement to polymerase chain reaction 
tiechnology. The ability of the A. aeolicus Pol III to extend primers for multiple 

20 kilobases makes possible the amplification of very long segments of DNA (long chain 
PGR). 

Another organism chosen for study and exposition herein is the 
extreme thermophile Thermotoga maritima. Thus, the present invention also relates 
to various isolated DNA molecules fi-om Thermotoga maritima^ in particular the DNA 

25 molecules encoding various replication proteins. These include dnaEy dnaX, dnaN, 
dnaQy holA^ holBy ssb DNA molecules fi"om Thermotoga maritima. These DNA 
molecules can be inserted into an expression system or used to transform host cells 
from which isolated proteins can be obtained. The isolated proteins encoded by these 
DNA molecules are also disclosed. 

30 Unless otherwisie indicated below, the Thermotoga maritima sequences 

were obtained by sequence comparisons using the Thermus thermophilus counterparts 
as query against the genome of Thermotoga maritima (Nelson et al., 1999). 
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The r. maritima dnaE gene has a nucleotide coding sequence 
according to SEQ. ID. No. 137 and encodes the a subunit of the of DNA Polymerase 
in, which has an amino acid sequence according to SEQ. ID. No. 138. The Tim. a 
subunit has approximately 33% aa identity to the T.th a subvinit. 
5 The r. maritima dnaQ gene has a nucleotide coding sequence 

according to SEQ. ID. No. 1 39 and encodes the e subunit of the of DNA Polymerase 
III, which has an amino acid sequence according to SEQ. ID, No. 140. ITieT.m. e 
subunit has approximately 34% aa identity to the T.th. e subunit. 

Th& T. maritima ^/noA^ gene has a nucleotide coding sequence 
10 according to SEQ. ID^No. 141 and encodes the x subunit of the of DNA Polymerase 
in, which has an amino acid sequence according to SEQ. ID. No. 142. The T.m. x 
subunit has approximately 48% aa identity to thte t.th. x subunit. 

The T. maritima dnaN%enp hais a nucleotide coding sequence 
according to SEQ. ID. No. 143 and encodes the P subunit of DNA Polymerase III, 
15 which has an amino acid sequence according to SEQ. ID. No. 144, The T.m. p 
subunit has approxiimately 28% aa identity to the T.th. P subunit. 

The T maritima ssb gene has a nucleotide coding sequence according 
to SEQ. ID, No. 149 and encodes the SSB protein, which has an amino acid sequence 
according to SEQ. ID. No. 150. The T./n. SSB protein has approximately 18% aa 
20 identity to the T.th. SSB protein. 

Further, the coding sequences of T maritima gen^s encoding the 
helicase {dnaB) and primase (dnaG) are also disclosed. The T. maritima dndB gene 
has a nucleotide coding sequence according to SEQ. ID. No. 151 and encodes the 
DnaB protein, which functions as a helicase and has an amino acid sequence 
25 according to SEQ. ID. No. 152. The T. maritima dnaG gene has a nucleotide coding 
sequence according to SEQ. ID. No. 153 and encodes the DnaG protein, which 
functions as a primase and has an amino acid sequence according to SEQ. ID. No. 
154. 

The T maritima holA andholB genes were previously imidentified by 
30 Nelson et al., 1999). Using the Thermus thermophilus 5 and 8' subunit amino acid 
sequences (SEQ. ID. Nos. 158 and 156, respectively) in separate BLAST searches 
(Altschul et al., 1997), corresponding polypeptide products in T. maritima were 
identified. The T maritima holA gene has a nucleotide coding sequence according to 



SEQ. ID. No. 145 and encodes the 6 sub\init of the of DNA Polymerase III, which has 
an amino acid sequence according to SEQ. ID. No. 146. The T.m. 5 subunit has 
approximately 37% aa identity to the TJh. 5 subunit. The T.m. holB gene has a 
nucleotide coding sequence according to SEQ. ID. No. 147 and encodes the 5' subunit 
5 which has an amino acid sequence according to SEQ. ID. No. 148. The T.m, 5- 
subvinit has approximately 25% aa identity to the r.r/i. 8' subunit. 

Yet another organism chosen for study and exposition herein is the 
extreme thermophile Bacillus stearothermophilus. Thus, the present invention also 
relates to varioxis isolated DNA molecules from jBac///tt5 5/earo^/ier7w^ 

10 particular the DNA molecules encoding various replication proteins. These include 
dfmE^ dnaXy dnoNy dnaQy holAy holB^ ssb DNA molecules from Bacillus 
stearothermophilus. These DNA molecules can be inserted into an e>q>ression system 
or used to transform host cells from which isolated proteins can be obtainied. The 
isolated proteins encodied by these DNA molecules are also disclosed. 

15 Unless otherwise indicated below, the Bacillus stearothermophilus 

sequences were obtained by searching the database of this organism (at 
http://www.genome.ou.edu)i 

The B. stearothermophilus polC gene has a nucleotide coding sequence 
accordingto SEQ. ID. No. 1 83 and encodes the PolC or a-large subunit of Ae DNA 

20 Polymerase III, which has an amino acid sequence according to SEQ. ID. No. 1 84. 
The B,st. PolC subunit, like the PolC subimits of other Gram positive organisms, 
contains both polymerase and 3-5* exonuclease activity. This subunit, therefore, is 
essentially a fusion of a and s. 

The B stearothermophilus dnaX gene has a partial nucleotide coding 

25 sequence according to SEQ. ID. No. 181 and encodes the x subunit of the of DNA . 
Polymisrase ni, which has a partial amino acid sequence according to SEQ. ID. 
No. 182. The B,st. x subimit has approximately 31% aa identity to the Tjh. x subunit. 

The A stearothermophilus dnaN gene has a partial nucleotide coding 
sequence according to SEQ. ID. No. 173 and encodes the subimit of DNA 

30 Polymerase III, which has a partial amino acid sequence iaccording to SEQ. ID. 

No. 174. The B.st.^ subunit has approximately 21% aa identity to ib& Tth. p subunit. 

The B, stearothermophilus ssb gene has a nucleotide coding sequence 
according to SEQ. ID. No. 175 and encodes the SSB protein, which has an amino acid 
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sequenceaccordingtoSEQ. lb. No. 176. The SSB protein has approximately 
23% aa identity to the TJh SSB protein. 

The B. stearothermophilus holA gene has a nucleotide coding sequence 
accordinjg to SEQ. ID- No. 177 and encodes the 5 subunit of DNA Polymerase III, 
5 which has 9n amino acid sequence according to SEQ. ID. No. 178. The B.st 5 
subiimt has approximately 26% aa i^^ 

. The B: stearothermophilus holB gene has a nucleotide coding sequence 
according to SEQ. ID. No. 179 and encodes the 6' subunit of DNA Polymerase III, 
which has an amino acid sequence according to SEQ. ID. No. 1 80. The B,st. 5' 

10 subimit has approxiniately 25% aa identity to the TJh, 5' subunit. 

By conducting BLAST searches of unidentified genomic DNA fi-om 
other thermophilic eubacteria, it is possible to identify coding regions which encode 
various functional subunits of other Pol III replicative machinery. 

Although it is generally appreciated that proteins isolated from a 

IS thermophile should retain activity at high temperature, there is no guairantee that they 
wdll retain tenipdrature resistance when isolated in pure form. This invention shows 
that thev4. aeolicus Pol III, like the T. thermophilus Pol III, is resistant to high 
temperature. It is expected that the Th, maritima and B, stearothermophilus Pol III 
enzymes will similialry be resistant id high temperature. 

20 The following experiments illustrate the identification and 

charactdization of the enzynies and constructs of the present invention. Accordingly, 
in Examples 1-8 below^ the identification and expression of tiie y and x is presented, 
as tiie first step in the elucidation of the Thermus thermophilus Polymerase III 
reflective of the present invention. Examples 9-12 which follow set forth the protocol 

25 for the purification of the remainder of the sub-units of the enzyme that represent 
substantial entirety of the functional replicative machinery of the enzyme. 
Examples 18-30 demonstrate the preparation of isolated A, aeolicus sequences Pol III 
subunits and their thermostable use. 



EXAMPLE 1 



EXPERIMENTAL PROCEDURES 

Materials 

DNA modification enzymes were fix)m New England Bw^ 
Labelled nucleotides were from Amersham, and unlabeled nucleotides were from 
New England Biblabs The Alter-1 vector was from Promega. pET plasmids and £. 
co// strains, BL21(DE3) and BL21(DE3)pLysS were fixjmNovagen: 
Oligonucleotides were from Operon, Buffer A is 20mM Tris-HCl (pH 7.5), O.lmM 
EDTA, SmMDTT, and 10% glycerol. 

Genomic DNA 

Thermus thermophilus (strain HB8) was obtained from the American 
Type Tissue Collection. Genomic DNA was prepared from cells groyvn in 0.1 1 of 
Thermus medium N697 (ATCG: 4 g yeast extmct, 8.0 g polypeptone (BBL 1 1910), 
2.0 g NaCl, 30.0 g agar, 1.0 L distilled water) at VS^'C overnight. Cells were collected 
by centrifugation at 4**C and the cell pellet was resuspended in 25 nil of 100 mM 
Tris-HCl Q>H 8.0), 0.05 M EDTA, 2 mg/ml lyso^mie and incubated at room 
temperature for 1 0 min. Then 25 ml 6. 1 0 M EDTA (pH 8.0), 6% SDS was added and 
mixed fpllpwed by 60 ml of phenol. The mixture was shaken for 40 min. followed by 
centrifugation at 1 0,000 X G for 1 0 min. at room temperature. The upper phase (50 
ml) was removed and mixed with 50 nil of phenol xhloroform (50:50 v/v) for 30 min. 
followed by centrifugation for 10 min. at room temperature. The upper phase was 
decanteid and the DNA was precipitated upon addition of 1/1 0th volume 3 M sodium 
acetate (pH 6.5) and 1 volvime ethanol. The precipitate was collected by 
centrifugation and washed twice with 2 ml of 80% ethanol, dried and resuspended in 
1 ml T.E. buffer (lOmM Tris Hcl (pH 7.5), ImM EDTA). 

Cloning of dnaX . 

DNA oligonucleotides fpr amplification of T. th genomic DNA were as 
follows. The upstream 32mer (5'-CGCAAGCTTCACGCSTACCTSTTCTCCGGSAC 
-3*, S indicating a mixture of G and C) (SEQ. ID. No. 6) consists of a Hind III site 
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within the first 9 nucleotides (underlined) followed by codons (SEQ. ID. No. 29) 
encoding the following amino acid sequence (HAYLFSGT) (SEQ. ID. No. 7). The 
downstream 34 mer (S'-CGCGAATTCGTGCTCSGGSGGCTCCTCSA 
(SEQ. ID. No. 8) consists of an EcoRI site (underlined) followed by codons (SEQ. ID. 
5 No. 30) encoding the sequence KTLEEPPEH (SEQ. ID. No. 9) on the complementary 
strand. The amplification reactions contained 10 ng TJh, genomic DNA, 0.5 mM of 
. each primer, in a volxmie of 100 ^il of Vent polymerase reaction mixture according to 
the manufacturers instructions (1 0 ^il ThermoPol Buffer, 0.5 mM each dNTP and 0.5 
n:iMMgS04). Amplification was performed using the foUoAving cycling scheme: 5 
10 cycles of: 30 sec. at 95.5**G, 30 sec. at 40''C, 2 min. at Ti^'C; 5 cycles of: 30 sec. at 
95.5**C, 30 sec. at 45**C, md 2 min. at 72**C; and 30 cycles of: 30 sw^ 
sec. at SO^C, and 30 sec. at 72^C. Rpdiicts were visualized in a 1.5 % native agarose 
gel. 

Genomic DNA was digested with either Xhol, Xbal, StuI, PstI, Ncol, 
15 Mlul, Kpnl, Hindlll, EcoRI, EagI, Bgll, or BamHI, followed by Southern analysis in 
a native agarose gel (Maniatis et al., 1982). Approximately 0.5 \xg of digest was 
analyzed in each lane of a 0.8 % native agarose gel followed by transfer to an MSI 
filter (Micron Sep2iratidns Inc.). The transfer included the following steps: 
1 . The agarose gel was soaked in 500 ml of 1% HCl with gentle shaking for 10 min. 
20 2. Then the gel was soaked in 500 ml of 0.5 M NaOH + 1 .5 M NaCl for 40 min. 

3. After that the gel was soaked in 500 ml of IM ammonivmi acetate for 1 h. 

4. The DNA was transferred to the MSI filter with the use of blotting paper for 4 h. 

5. The filter Svas kept at 80**C for 15 liiin. in the oven. 

6. The pre-hybridizatioh step was run in 10 ml of Hybridization solution (1% 

25 crystalline BSA (flection V) (Sigma), 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 7% 
SDS) at 65«C for 30 min. 

7. The probe, radiolabelled by the random priming method (see below), was added to 
the pre-hybridization solution and kept at 65**C for 12 h. 

8. The filter was washed with low stringency with 200 ml of the wash buffer (0.5% 
30 BSA, fi-actionV), ImM Na2EDTA, 40 mM NaHP04 (pH 7.2), 5% SDS with genUe 

shaking for 20 min. This step was repeated 5 times, followed by exposure to X-ray 
film (XAR-5, Kodak). 

As a probe, the PGR product was radiolabelled by random as follows. 
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1. 14 ml of the mixture containing 0.2 of PGR product DNA, 1 ^ig of the pd(N6) 
(Promega) and 2.5 ml of the lOX Klenow reaction buffer (100 mM Tris-HCl (pH 7.5), 
50 mM MgCb, 75 mM dithiothreitol) were boiled for 10 min. and then kept at 4**C. 

2. The reaction volwne was increased up to 25 jil, containing in addition 33 jiM of 
5 each dNTP, except dATP, 10 nCi [a-^^P] dATP (800 Ci/mM), and 2 units of Klenow 

enzyme. The reaction mixture was incubated 1.5 h. 

3. 2 mg of sonicated herring sperm DNA (GibcoBRL) was added to the reaction and 
the volunie was increased to 2 ml using hybridization solution. The sample was then 
boiled for 10 min. 

10 A genomic library of Xbal digested DNA was prepared upon treating 1 

lig genomic T.th. DNA with 10 units of Xbal in 100 ^1 of NEBuffer N2 (50 mM 
NaCl, 10 mM Tris-HCl (pH 7.9), 10 mM MgC12, 1 mM DTT) for 2 h at 37^C. The 
digested DNA was purified by phenol chloroform extraction and ethanol 
precipitation. The Alter-1 vector (0.5 fig)(Promega) was digested with 1 unit of Xbal 

15 in NEBufifer N2 and then purified by phenol/chloroform extraction arid ethanol 
precipitation. One microgram of genomic digest was incubated with 0.05 |ig of 
digested Alter-1 and 20 U of T4 ligase in 30 ^^ of ligase buffer (50 mM Tris-HCl (pH 
7.8), 10 mM MgC12, 10 mM DTT and 1 mM ATP) at 15'C for 12 h. The ligation 
reaction was transformed into the DHSa istrain of E. coli and transfonhants were 

20 plated on LB plates containing ampicillin and screened for the ^noA!^ insert using the 
radiplabelled PCR probe as follows: 

1 . The colonies tested were lifted onto MSI filters, approximately 100 colonies to 
each filter. 

2. The filters, removed fi-om the LB/Tc plates, were placied side up on a sheet of 
25 Whatman 3 MM paper soaked with 0.5 M NaOH for 5 min. 

3. The filters wiere transferred to a sheet of paper soaked with 1 M Tris-HCl (pH 7.5) 
for 5 min. 

4. The filters were placed on a sheet of paper soaked in 0.5 M Tris-HCl (pH 7.5), 
1.25 M NaCl for 5 min. 

30 5. After drying by air, the filters were heated in the oven 80*^ C for 1 5 min. and then 
were analyzed by Southem hybridization. 

Plasmid DNA was prepared fi-om 20 positive colonies; of these 6 contained the 
expected 4 kb insert when digested with Xbal. Sequencing of the insert was 



perfonned by the Sanger method using the Vent polymerase sequencing kit according 
to the manufacturers instructions (New England Biolabs). 



Identification of the dnaXaene 

The d>iaA!^ genes of the gram negative E. coli and the gram positive B. 
subtilis share more than 50% identity in amino acid sequence within the N-terminal 
1 80 residues containing the ATP-binding domiain (Fig. 2). Two highly conserved 
regions (shown in bold in Fig. 2) were Used to desi^ oligonucleotide primers for 
application of the polymerase chain reaction to TJh, genonuc DNA. The expected 
PGR product, including the restriction sites (i.e. before cutting) is 345 nucleotides. 
Use of these primers with genomic TJh DNA resulted in a product of the expected 
size. The PGR product was then radiolabelled and used to probe genomic DNA in a 
Southern analysis (Fig. 3). Genomic DNA was digested with several different 
restriction endonucleases, electrophpresed in a native agarose gel and then probed 
with the PGR firagment. The Southern analysis showed an Xbal fragment of 
approximately 4 kb, more than sufficient length to encode the c/ajoAT gene. Other 
restriction nucleases produced firagments that were significantly longer, or produced 
two or more fi-agments indicating presence of a site within the coding sequence of 
dnaX. 

To obtain fiiU length dnaX^ genomic DNA was digested with Xbal and 
ligated into Xbal digested Alter-1 vector. Ligated DNA was transformed into DH5 
alpha cells, and colonies were screened with the labeled PGR probe. Plasmid DNA 
was prepared from 20 positive colonies and analyzed for the appropriate sized insert 
using Xbal. Six of the twenty clones contained the expected 4 kb Xbal Augment as 
an insert, the sequence of which is shown in Figs. 4 A and 4B. 

The frameshiff site 

The dnaX gene of E, coli produces two proteins, the y and t subunits, 
by a -1 frameshift (Tsuchihashi and Romberg, 1990; Flower and McHenry, 1990; 
Blinkowa and Walker, 1990), The fiill length product yields x, and the fi-ameshift 
results in addition of one amino acid before encountering a stop codon to produce y. 
The -1 frameshift site in the E. coli dnaX gene contains the sequence, A AAA AAG, 
which follows the X XXY YYZ rule found in retroviral genes (Jacks et al., 1 988). 
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This "slippery sequence" preserves the initial two residues of the tRNAs in the 
aminoacyl and peptidyl sites both before and after the frsuneshift. Mutagenesis of the 
E. coli dnaX firameshifting site has shown that the first three residues can be 
nucleotides other than A, but that A's in the second set of three nucleotides is 
5 important to franieshiiting (Tsuchihashi and Brown, 1 992). 

Immediately downstream of the stop codon is a potential stem-loop 
structure which enhances frameshifting, presvmiably by causing the ribosome to 
pause. Fiuther, the AAG cbdon lacks a cognate tRNA in E. coli and thixis the G 
residue may facilitate the pause, and has been shown to aid the vigorous frameshifting 

10 observed in the E. coli dnaX gene (Tsuchihashi and Brown, 1992). A fourth 
component of frameshifting in the coli dnaX gene is presence of an upstream 
Shine-Dalgarno sequence which is thought to pair with the 1 6S rRNA to increase the 
frequency of frameshifting still further (Larsen et al., 1994^^ 

Examination of the TJh dnaX sequence reveals a isingle site that 

1 5 fulfills the X XXY YYZ rule in which positions 4-7 are A residues. The site is unique 
from that in E, colz as all seven residues are A, and the heptanucleotide sequence is 
flanked by another A residue on each side (i.e. A9). Surprisingly, the stop codon 
immediately downstream of this site is in the -2 frame, although there is a stop codon 
in the -1 fr'ame 28 nucleotides downstream of the -2 stop codon. Indeed, a -2 

20 frameshift would fulfill the requirement that the first two nucleotides of each codon in 
the peptidyl and aminoacyl sites be conserved during either a -1 or a -2 firanieshift. 
As with the case ofE, coli dnaX, there are secondary structure step loop structures 
immediately downstream. Finally, there is a Shine-Dalgamo sequence inmiediately 
adjacent to the frameshift site, as well as another Shine-Dalgamo sequence 22 

25 nucleotides upstream pf the frameshift site. 

Assuming the first stop codon is utilized (i.e. -2 frameshift), the 
predicted size of the y subunit in TJk is 454 amino acids for a mass of 49.8 kDa, over 
2 kDa larger than the 43 1 residue y subunit (47.5 kDa) of E: coli. This would result in 
2 residues after the -2 frameshift (i.e. after the GluLysLys, the residues LysAla would 

30 be added) to be compared to the result of the -1 frameshift in E. coli which also results 
in 2 residues (LysGlu). In the event that a -1 frameshift were utilized in the r.r/z: 
dnaX gene, then an additional 12 residues would be added following the frameshift 
for a molecular mass of 50.8 kDa (i.e. after the GluLysLys,. the residues 



LysProAspProLysAlaProProGlyPrpThrSer would be added at aa 453-464 of SEQ. ID. 
No. 4). As explained later, this nucleotide sequence was found to promote botii -1 
and -2 fiBmeshifting in E, coli (Fig. 8). But first, we examined TJh cells by Westem 
analysis for the presence of two subunits homologous to E. coli y and x. 

EXAMPLE 2 

Frameshiftine analysis of the TJh. dnaXaene 

Frameshifiing was analyzed by inserting the firameshift site into lacZ in 
the three different reading fi^es, followed by plating on JT-ga/ and scoring for blue 
or white colony formation (Weiss et al., 1987). The fi^ameshifting region within T. /A 
dnaX\^BS subcloned into the EcoRI/BamHI sites of pUC19. These sites are within 
the polylinker inside of the B-galactosidase gene. Three constructs were produced 
such that the insert was either in frame with the downstream coding sequence of 
B-galactosidase, or were out of frame (either -1 or -2). An additional three constructs 
were designed by mutating the firameshift sequence and then placing this insert into 
the three reading frames ofthefi-galactosidase gene. Theise six plasmids were 
constructed as described below. 

The upstream primer for the shifty sequences was 5 -gcg egg ate egg 
agg g^g ^ gcc tea gcc ca-3' (SEQ. ID. No. 10). The BamHI site for cloning 

into pUC is xmderlined. Also, the stop codon, tga, has beien mutated to tea (also 
underlined). The upstream primer for the mutant shifty sequencei was: 5 -gcg egg ate 
egg agg gag aga aga aaa gee tea gcc ca-3' (SEQ. ID. No. 11). The mutant sequence 
contains two substitutions of a G for an A residue in the polyA stretch (underlined). 
Three downstream primers were utilized with each upstream primer to create two isets 
of three inserts in the 0 frame, -1 frame and r2 frame. The sequence of these primers, 
and the length of insert (after cutting with EcoRI and BanHI and inserting into 
pUC19) are as follows: 5*-gaa tta aat teg cgc ttc ggg agg tgg g-3' (0 frameshift, total 
58 nucleotide insert) (SEQ. ID. No. 12); 5'-gcg cga att cgc get teg gga ggt ggg-3' (-1 
frame, 54mer insert) (SEQ. ID. No. 13); and 5'-gcg cga att egg gcg ctt cag gag gtg 
gg-3' (-2 frame, 56mer insert) (SEQ. ID. No. 14). The downstream primers have an 
EcoRI site (underlined); the EcoRI site of the 0 frame insert was blunt ended to 
produce the greater length insert (converting the EcoRI site to an aattaatt sequence). 
Also, the teg sequence, which produces the tga stop codon (underlined) was mutated 
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to tea in the -2 downstream primer so that readthrough would 
frameshift occurred. 

In summary, a region surrounding the firameshifl site and ending at 
least 5 nucleotides past the -1 frameshift stop codon was inserted into the B- 
S galactosidase gene of pUC19 in the three different reading frames (stop codons were 
mutated to prevent stoppage following a fi^eshift). These three plasmids were 
introduced into E, coli and plated with X-gal. The results, in Fig. 8, show that blue 
colonies were observed after 24 h incubation with all three plasmids and therefore 
both -1 and -2 framesfaifting had occurred. 

10 To further these results, two 7 residues were introduced into the polyA 

tract which should disrupt the ability of this sequence to direct frameishifls. The 
mutated slippery sequence was inserted into pUC19 followed by transformation into 
E. coli and plating on X-gal. The results showed that both -1 and -2 frameshiftirig 
was prevented, ftuther supporting the fact that frameshifting requires the polyA tract 

15 as expected (Fig. 8). 

EXAMPLES 

Expression vector for TJh. y and x 

20 The dnaX gene was cloned into the pET16 expression vector in the 

steps ishown in Fig. 9. First, the bulk of the gene was cloned into pET16 by removing 
the Pmll/Xbal fragment from pAlterdnaX, and placing it into Smal/Xbal digested 
Fuel 9 to yield Fuel 9dnaXCterm. The N-terminal sequence of the dnaX gene was 
then reconstructed to position an Ndel site at the N-terminus. This was perfomied by 

25 amplifying the 5' region encoding the N-terminal section of y/x using an upstream 
primer containing an Ndel site that hybridizes to the dnaXg^ne at the initiating gtg 
codon (i.e. to encode Met where the Met is created by the PGR primer, and the Val is 
the initiating gtg start codon ofdrtaX). The primer sequence for this 5' end was: 
5'-gtpgt gcatatg gtg age gcc etc tac cgc c-3' (SEQ. ID. No, 15) (where the Ndel site is 

30 underlined, and the coding sequence of dnaX follows). The downstream primer 
hybridizes past the Pmll site at nucleotide positions 987 - 1004 downstream of the 
initiating gtg (primer sequence: 5'-gtggtggtcgac cca gga ggg cca cct cca g-3' (SEQ. 
ID. No. 16) where the initial 12 nucleotides contain a SalGI restriction site, followed 
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by the sequence from the region downstream the stop codon). The 1.1 kb nucleotide 
PGR product was digested with Pmll/Ndel and the Pmll/Ndel fragment was ligated 
into Ndel/PniU digested Pucl9dnaXCtenii to fonnPucl9dn^^ ThePucl9dnaX 
plasmid was then digested with Ndel and Sail and the 1.9 kb fragment containing the 
S ^naA!^gene was purified using the Sephaglas BandPrep^^ 

pETl 6b was digested with Ndel and XhoL Then the full length dnaX gene was 
ligated into the digested pETl 6b to form pET JnoAT. 



EXAMPLE 4 

10. ' . , ' 

Expression of Tjh. y and x 

As discussed in the previous example, the dnaX%en^ was engineered 
into the T7 based IPTG inducible pETl 6 vector such thiat the initiation codon was 
placed precisely following the Met residue N-terminal leader sequence (Fig. 9). This 

Is should produce a protein containing the entire sequence of y and t, along with a 21 

residue leader containing IQ contiguous His residues (tagged-x = 60.6 IdDa; tagged-y 
= 52.4 kDa for -2 fiameshift). The pETd>2aA^ plasmid was introduced into . 
BL21(DE3)pLysS cells harboring Ae gene encoding T7 RNA polymerase under . , 
control of the lac r^ressor. Log phase cells were induced with IPTG and analyzed 

20 before and after induction in an SDS polyacrylamide gel (Fig. 10, laneis 1 aiid 2). The 
result shows that upon induction, two new proteins are expressed with the 
approximate sizes expected of the TJh y and x subunits (larger than E, coli y, arid 
smaller than E, coli x). The two proteins are produced in nearly equal amounts, 
similar to the case of the\E. coli y and x subimits. Westem analysis using antibodies 

25 against the E. coli y and x subunits cross-reacted with the induced proteins further 
supporting their identity as TJh. y and x (data not shown, but repeated with the pure 
subunits shown in Fig. 10, lane 6). 



30 



EXAMPLES 

Purification of T.th. y and x 

The His-tagged TJh, y and x proteins were piirified fi-om 6 L of 
induced E. coli cells containing ihe pETdnaX plasmid. Cells were lysed, clarified 
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from cell debris by centrifugation and the supernatant was applied to a HiTrap chelate 
affinity column. Elution of the chelate affinity column yielded approximately 35 mg 
of protein in which the two predominant bands migrated in a region consistent with 
the molecular weight predicted from the dnaX gtne (Fig. 10, lane 3), and produced a 
5 positive signal by Western analysis vising polyclonal antibody directed against the E. 
coli y and t subunits (lane 4). The y and t subuiiits are present in nearly equal 
amoimts consistent with the nearly equal expression of these proteins in E. coli cells 
harboring the pET^na^plasmid. 

llie y and x subunits were further purified by gel filtration on a 

10 Superose 12 column (Fig. 10, lane 4; Fig. 11). Recovery of T.th. y and t subunits 

through gel filtration was 81%. The K coli y and x subunits, when separated from one 
another, elute during gel filtration as tetramers. A mixture of E. coli y/x results in a 
mixed tetramer of y2x2 along with y4 and x4 tetramers (Onrust et al., 1995). The 
mixture of TJh y/x elutes ahead of the 1 50 kDa marker, and thus is consistent with . 

15 the expected mass of a y2x2 tetramer (225 kDa) and y4 and x4 tetramers. 

As described earlier, the dnaX firameshifring sequence could produce 
either a -1 or -2 firamehift to yield a His-tagged y subunit of mass either 53.3 kDa or. 
52.4 kDa, respectively. The difference in these two possible products is too close to 
determine fix>m migration in SDS gels. It also remains possible that two y products 

20 are present and do not resolve vmder the conditions used. The exact protocol for this 
purificatioii is described below. 

Six liters of BL2 1 (DE3)pLy sSpET^/AioA^ cells were grown in LB media 
containing 50 ^g/ml ampicillin and 25 \ig/m\ chloramphenicol at 37°C to ah O.D. of 
0.8 and then IPTG was added to a concentration oif 2 mM. After a fiirther 2 h at ST^'C, 

25 cells were harvested by centrifiigation and stored at -70^C. The following steps were 
performed at 4^C. Cells (15 g wet weight) were thawed and resuspended in 4!5 ml IX 
bmding buffer (5 mM imidizole, 0,5 M NaCl, 20 mM Tris HCl (final pH 7.5)) using a 
dounce homogenizer to complete cell lysis and 450 ml of 5% polyamine P (Sigma) 
was added. Cell debris was removed by centrifugation at 18,000 rpm for 30 min. in a 

30 Sorvall SS24 rotor at 4*^C. The supematant (Fraction 1, 40 ml, 376 mg protein) was 
applied to a 5 ml HiTrap Chelating Separose column (Pharmacia-LKB). The colunm 
was washed with 25 ml of binding buffer, then with 30 ml of binding buffer 
containing 60 mM imidizole, and then eluted with 30 ml of 0.5 M imidizole, 0.5 M 
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NaCl, 20 mM Tiis-HCl (pH 7.5). Fractions of 1 ml were collected and analyzed on 
an 8% Coomassie Blue stained SDS polyacrylamide gel. Fractions containing 
subxmits migrating at the T,th y and t positions, and exhibiting cross reactivity with 
antibody to E, coli y and x in a Westem analysis, were pooled and dialyzed against 
5 buffer A (20 mM Tris-HCl (pH 7.5), 0.1 mM EDTA, 5 mM DTT and 1 0% glycerol) 
containing 0.5 M NaCl (Fraction II, 36 mg in 7 ml). Fraction II was diluted 2-fold 
. with buffer A and passed through a 2 ml ATP agarose column equilibrated in buffer A 
containing 0.2 M NaCl to remove any E. coli y complex contaminant. Then 0. 1 8 mg . 
(300 ml) Fraction II was gel filtered on a 24 ml Superose 12 column 

10 (Phannacia-LKB) in buffer A containing 0.5 M NaCl. After the first 216 drops, 

firactions of 200 pi were collected (Fraction III) and analyzed by Westem analysis (by 
procedures similar to those described in Example 6), by ATPase assays and by 
Coomassie Blue staining of an 8% Coomassie Blue stained SDS polyacrylamide gel. 
The Coomassie stained gels and Westem analysis of recombinEmt T,th, gamma and 

15 tau for these purification steps are summarized in Fig. 10. 

EXAMPLE 6 

Westem Analysis of T.th. cells for presence of v and t subunits 
20 Polyclonal antibody to E. coli y/x - E. coli y subunit was prepared as 

described (Studwell-Vaughan and O'Dbnnell, 1991). Pure y subunit (100 ^lg) was 
brought up in Freund's adjuvant and injected subcutanebusly into a New Zealand 
Rabbit (Poccono Rabbit Farms). After two weeks, a booster consisting of 50 |ag y in 
Frexmd's adjuvant was administered, followed after two weeks by a third injection (50 
25 ligy 

The homology between the amino terminal regions of TJh, and E. coli 
y/x subunits suggested that there may be some epitopes in common between them. 
Hence, polyclonal antibody directed against the E. coli y/x subunits was raised in 
rabbits for use in probing Tjh, cells by Westem analysis. Fig. 7 shows the results of a 
30 Westem analysis of whole T.tk cells lysed in SDS. The results show that in TJh. 

cells, the antibody is rather specific for two high molecular proteins which migrate in 
the vicinity of the molecular masses of E, coli y and x subimits. 
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Procedure for Western Analysis 

Samples were analyzed in duplicate 10 % SDS polyacrylamide gels by 
the Western method (Towbia et al. 1979). One gel was Coomassie stained to evaluate 
the pattern of proteins present, and the other gel was then electroblotted onto a 
5 nitrocellulose menibrane (Schleicher and Schuell). For molecular size markers, the 
kaliedoscope molecular wei^t markers (iBio-Rad) were used to verify by 
visualization that transfer of proteins onto the blotted membrane had occurred, llie 
gel used in electroblotting was also stained after electroblotting to confirm that 
efficient transfer of protein had occiured. Membranes were blockedl using 5% non-fat 

10 milk, washed with 0.05% Tween in TBS (TBS-l^ and then incubated for o 

with a 1/5000 dilution of rabbit polyclonal antibody directed against E, coli y and x in 
1 % gelatin in TBS-T at room temperature. Membranes were washed using TBiS-T 
buffer and then antibody was detected on X-ray film (Kodak) by using the ECL kit 
fi-om (Amersham) and the manufactures recconmiended procedures. 

15 Samples included: 1) a mixture of E. coli y {15 ng) and x (15 ng) 

subunits; 2) T.th. whole cellis (100 |iil) suspended in cracking buffer; and 3) purified 
r.r/i. y and T firaction n (0.6 |ig to a mixture). 

EXAMPLE 7 

20 

Characterization of the ATPase Activity of y/t 

The E. coli t subunit is a DNA dependent ATPase (Lee and Walker, 
1987; Tsuchihashi and Komberg, 1989). The y subunit binds ATP but does not 
hydrolyze it even in the presence of DNA unless other subunits of the DNA 

25 polymerase III holoenzyme are also present (Orirust et al., 1991). Next we examined 
the T.th. ylx subunits for DNA dependent ATPase activity. The y/x preparation was, 
in fact, a DNA stimulated ATPase (Fig. 1 1 , top panel). The specific activity of the 
T.th ylx was 1 1.5 mol ATP hydrolyzed/mol ylx (as monomer and assuming an equal 
mixture of the two). Furthermore, analysis of the gel filtration column fractions 

30 shows that the ATPase activity coelutes with the TJh ylx subimits, supporting 

evidence that the weak ATPase activity is intrinsic to the y/x subunits (Fig. 11). The 
specific activity of the y/x preparation before gel filtration was the same as after gel 
filtration (within 10%), further indicating that the DNA stimulated ATPase is an 
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. inherent activity of the y/x subunits. Presumably, only the x subunit contains ATPase 
activity, as iii the case of E, coli. Assuming only TJh x contains ATPase activity, its 
specific activity is tvsdce the observed rate (after factoring om the weight of y)^^ This 
rate is still only one-fifth that of E, coli x. 
5 The T,th y/x ATPase activity is lower at 37°C than at 65^C (middle 

panel), consistent with the expected behavior of protein activity from a thermophilic 
source. However, there is no apparent increase in activity in proceeding from 50**C to 
eS'^C (th€5 rapid breakdown of ATP aboye SS^'C precluded measurement of ATPase 
activity at temperatures above 65**C). In contrast, the E. coli x subunit lost most of its . 

10 ATPase activity upon (elevating the temperature to 50**C (middle panel). These 

reactions contain no stabilizers such as a nonionic detergent or gelatin, nor did they 
include substrates such as ATP, DNA or magnesixim. 

Last, the relative stability of TJh, y/x and £. coli y/x to addition of 
NaCl (Fig. 12,. bottom panel) was examined. Whereas the E. coli x subunit rapidly 

15 lost activity at even 0.2 M NaCl, the T.th y/x retained full activity m 1.0 M NaCl and 
was still 80 % active in 1 .5 M NaCl. The detailed procedure for the ATPase activity 
assay is described below. 

. ATPase assays 

20 ATPase assays were performed in 20 ^l of 20 mM Tris-HCl (pH 7.5), 

8 mM MgCh containing 0.72 ^g of M13mpl8 ssDNA (where indicated), 100 inM 
[y-^^P]-ATP (specific activity of 2000-4000 cpm/pmol), and the indicated protein. 
Some reactions contained additional NaCl where indicated. Reactions were incubated 
at the temperatures indicated in the figure legends for 30 min. and then were 

25 quenched with an equal volume of 25 mM EDTA (final). The aliquots were analyzed 
by spotting them (1 jil each) onto thin layer chromatography (TLC) sheets coated with 
Cel-300 polyethyleneimine (Brinkmaim Instruments Co.). TLC sheets were 
developed in 0,5 M lithium chloride, 1 M formic acid. An autoradiogram of the TLC 
chromatogram was used to visualize Pi at the solvent fi-ont and ATP near the origin 

30 which were then cut fi-om the TLC sheet and quantitated by liquid scintillation. The 
extent of ATP hydrolyzed was used to calculate the mol of Pi released per moil of 
protein per min. One mol of E. coli x was calculated assuming a mass of 71 kDa per 
monomer. The T.th y and t preparation was treated as an equal mixture and thus one 
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mole of protein as monomer was the average of the predicted masses of the y and x 
subunits (54 kDa). 

EXAMPLES 

Homolog of TJh, y/t to dnaX gene products of other organism 

The Xbal insert encoded an open reading frame, starting with a GTG 
codon, of 529 amino acids in length (158.0 kDa), closer to the predicted length of the 
B. subtilis X subunit (563 amino acids, 62.7 kDa mass)(Alonso et al., 1990) than the £1 
coli T subunit (71 . 1 kDa)(Yin et al., 1 986). The dnaX gene encoding the y/x subunits 
of coli DNA polymerase III holoenzyme is homologous to the holB g&ic encoding 
the 5' subimit of the y complex clamp loader, and this homology extends to all 5 
subvmits of the eukaryotic RFC clamp loader as well as the bacteriophage gene 
protein 44 of the gp44/62 clamp loading complex (O'Donnell et al., 1993). These 
gene products show greatest homology over the N-temiinal 166 amino acid residues 
(pfK coli dnaX)\ the C-temiinal regions are more divergent. Fig. 4 shows an 
alignment of the amino acid sequence of the N-temiinal regions of the TJh dnaX 
gene product to those of several other bacteria. The consensus GXXGXGKT (SEQ. 
ID. No. 1 7) motif for nucleotide binding is conserved in all these protein products. 
Further, the E. coli 8' crystal structure reveals one atom of zinc coordinated to four 
Cys residues (Guenther, 1996). These four Cys residues are conserved in the E. colt 
dnaXgenCy and the y and x subunits ericoded by R coli dnaX bind one atom of zinc. 
These Cys residues are also conserved in T.th. dnaX (shown in Fig; 4). Overall, the 
level of amino acid identity relative to E. coli dridXin the N-terminal 165 residues of 
T.th, dnaXis 53 %. Th^ T.th dnaX gene is just as homologous to the 5. subtilis dnaX 
(53 % identity) gene relative to JE. coli dnaX. After this region of homology, the 
C-terminal region of TJh dnaX shares 26% and 20% identity to E. coli and B, subtilis 
dnaX, respectiyely , A proline rich region, downstream of the conser/ed region, is 
also present ui TJh. dnoA" (residues 346075), but not in the B. subtilis dnaX(see Figs. 
3 A and 3B). The overall identity between E, coli dnaX and TJh dnaX oyer the entire 
gene is 34%. Identity of TJh. dnaX to B. subtilis dnaX over the entire gene is 28%. 
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Comparison of dnaX genes from TJh and E: coli 

The above identifies a homologue of the dnaXgene ofE., coli in 
Thermos thermophilus. Like the E. coli gene, TAh dnaX encodes two related proteins 
through use of a highly efficient translational frameshift. The TJh, y/x subunits are 
5 tetramers, or mixed tetramers, similar to the y and x subimits of E, coli. Further, the 
y/x subimit is a DNA stimulated ATPase like its E, coli covinterpart. As expected for 
proteins from a thermophile, the TjIl y/x ATTase activity is thermostabile and 
resistant to added salt. 

In E: coli, y i s a component of the clamp loader, and the x subunit 

10 serves the fimction of holding the clamp loading apparatus together with two DNA 
polymerases for coordinated rqjlicatipn of duplex DNA. The presence of y in 71 /fc 
suggests it has a clamp loading apparatus and thus a clamp as well. The presehce of 
. the X subunit of T th, implies that T th. contains a replicative polymerase with a 
structure similar to that of E. coli DNA polymerase III holoenzyme. 

15 A significant difference between E. coli and TJh. dnaX genes is in the 

translational frameshift sequence. In E, coli, the heptamer frameshift site contains six ^ 
A residues followed by a G residue in the context A AAA AAG. This sequence 
satisfies the X XXY YYZ rule for -1 frameshiftihg. The frameshift is made more 
efficient by the absence of the AAG tRNA for Lys which presumably leads to stallii^ 

20 of the ribosome at the frameshift site and increases the efficiency of frameshifUng 
(Tsuchihashi and Brown, 1992). Two additional aids to frameshifting include a 
downstream hairpin and an upstream Shine-Dalgamo sequence (Tsuchihashi arid 
Komberg, 1990; Larsen et al., 1994). The -1 frameshift leads to incorporation of one 
unique residue at the C-terminus of E, coli y before encounter with a stop codon. 

25 In T.th. , the dnaX frameshifring heptamer is A AAA AAA, and it is 

franked by two other A residues, one on each side. There is also a dovvmstream region 
of secondary structure. The nearest downstream stop codon is positioned such that 
gamma would contain only one unique amino acid, as in E, coli. However, the TJh. 
stop codon is in the -2 reading frame thus requires a -2 frameshift. No precedent 

30 exists in nature for -2 frameshifting, although -2 frameshifting has been shown to 
occur in test cases (Weiss et al., 1987). In vivo analysis of the T.th frameshift 
sequence shows that thi^ natural sequence promotes both -1 and -2 frameshifting in E. 
coli. Whereas the -2 frameshift results in only one unique. C-tefminal residue, a -1 
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firameshift wdvdd result in an extension of 12 C-tenninal residues. At pre%nt, the 
results do not discmninate which path occurs in r.//i., a -1 or -2 fraxneshift, or a 
combination of the two. 

There are two Shine-Dalgamo sequences just upstream of the 

S firameshift site in Tih dnaX. In two cases of firameshiftihg in E. colU an upstreanci 
Shine-Dalgamo sequence has been shown to stimulate fi:ameshifting (reviewed in 
Weiss et al., 1 897). In release factor 2 (RF2), the Shine-Dalgamo is 3 nucleotides 
upstream of the shift site, and it stimulates a +1 frameshift event. In the case of E. 
coll dnaX, a Shine-Dalgamo sequence 10 nucleotides upstream of the shift sequence 

10 istimulates the -1 fiameshift. One of the T.th. d«aA^ Shine-Dalgamo sequences is 

immediately adjacent to the frameshift sequence with no extra space, the other is 22 
residues upstream of the frameshift site. Which of these Shine-Dalgamo sequences 
plays a role in T.th dnaX frameshifting, if any, will require fiiture study. 

In jB. colU efficient separation of the two polypeptides, y and x, is 

15 achieved by mutation of the fiameshift site such that only one polypeptide is produced 
fix)m the gene (Tsuchihashi and Romberg, 1 990). Substitution of G-to-A in two 
positions of the heptamer of T.th. d>iaA!^ eliminates frameshifting and thus should be a 
source to obtain x subunit fi^e of y. To produce pure y subunit free of x, the 
frameshifting site and sequence immediately downstream of it can be substituted for 

20 an in-fi*ame sequence with a stop codon. 

Examination of the B. subtilis dnaX gene ishows no frameshift 
sequence that satisfies the X XXY YYZ rule. Hence, it would appear that dnaX docs 
not make two proteins in this gram positive organism. 

Rapid thermal motions associated with high temperature may make 

25 coordination of complicated processes more difficult. It seems possible that 
organizing the components of the replication apparatus may beconie yet more 
important at higher temperature. Hence, production of a x subtinit that could be used 
to crosslink two poljonerases and a clamp loader into one oirganized particlie may be 
most useftil at elevated temperature. 

30 As stated above, the following examples describe the continued 

isolation and purification of the substantial entirety of the Polymerase III from tiie 
extreme thermophile Thermus thermophilus. It is to be understood that the following 
exposition is reflective of the protocol and characteristics, both moqDhological and 
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functioncil, of the Polymerase Ill-type enzymes that are the focus of the present 
invention, and that the invention is hereby illustrated and comprehends the entire class 
of enzyines of thermophilic origin. 

5 EXAMPLE 9 

Purification of the Thermus thermophilus DNA polymerase III 

AU steps in the pimficatipn assay were performed at 4^C. The 
following assay was uised in the purification of DNA polymerase fit>m TJh cell 

10 extracts. Assays contained 2.5 mg activated calf thymus DNA (Sigma Chemical 
Company) in a final vbiume of 25 ml of 20 mM Tris-Cl (pH 7.5), 8 miM MgCh, 5 
mM DTT, 0.5 riiM EDTA, 40 mg/ml BSA, 4% glycerol, 0.5 mM ATP, 3 mM each 
dCTP, dGTP, dATP, aiid 20 mM [a-^^P]dTTP- An aliquot of the fiaction to be 
assayed was added to the assay niixture on ice followed by incubation at 60*'C for 5 

15 min. DNA synthesis was quantitated using DE81 paper followed by washing off 
imincorporated nucleotide. Incorporated nucleotide was determined by scintillation 
counting of the filters. 

Thermus thermophilus cell extracts were prepared by suspending 35 
grams of cell paste in 200 ml of 50 mM TRIS-HCl, pH=7.5, 30 mM spermidine, 100 

20 mM NaCi, 0.5 mM EDTA, 5 mM DTT, 5% glycerol, followed by disruption by 
passage trough a French pressure cell (1 5,000 PSI). Cell debris was removed by 
centrifiigatioh (12,000 RPM, 60 mih). DNA polymerase III in the clarified 
supernatant was precipitated by treatment with ammonium sulphate (0.226 gm/liter) 
and recovered by centrifugation. This fi-action was then backwashed with the same 

25 buffer (but lacking spenhidine) containing 0.20 gm/1 ainmoniiun sulfate. The pellet 
was then resuspendbd in buffer A and dialyzed overnight against 2 liters of buffer A; 
a precipitate which formed during dialysis was removed by centrifugation (17,000 
RPM, 20 min). 

The clarified dialysis supernatant, containing approximately 336 mg of 
30 protein, was applied onto a 60 ml heparin agarose colunin equilibrated in buffer A 
which was washed with the same buffer until A280 reached baseline. The colvimn 
was developed with a 500 ml linear gradient of buffer A fi-om 0 to 500 mM NaCl. 
More tightiy adhered proteins were washed off the column by treatment with buffer A 
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(20 mM Tris Hcl, pH - 7.5, 0.1 mM EDTA, 5mM DTT, and 10% glycerol) and IM 
NaCl. Some DNA polymerase activity flowed through the coliunn. Two peaks 
(HEP.P1 and HEP.P2) of DNA polymerase activity eluted from the heparin agarose 
colunm containing 20 mg and 2 mg of total protiein respectively (Fig. 13A). These 
S were kept separate throughout the remainder of the purification protocol. 

The Pol in resided in HEP .PI as indicated by the following criteria: 
1) Western anialysis using antibody directed against the a subunit of E. coli Pol III 
indicated presence of Pol III in HEP.Pl; 2) Only the HEP.Pl fraction was capable of 
extending a single primer around an M13mpl8 7.2 kb ssDNA circle (explained later 
10 in Example 1 6), such long primer extension being a characteristic of Pol III type 
enzymes; and 3) Only the HEP.P 1 provided DNA polymerase activity that was 
retained on an ATP-agarose affinity column, which is indicative of a Pol Ill-tyjpe 
DNA polymerase since the y and x subunits are ATP interactive proteins. 

The first peak of the heparin agarose colunm (HEP.Pl: 20 mg in 127.5. 
15 mi) was dialyzed jagainst bxiffer A and applied onto a 2ml N6-linkage ATP agarose 

column pre-equilibrated in the same buffer. Bound protein was eluted by a slow (0.05 
ml/mih) wash with biiflfer A + 2M NaCl and collected into 200 \il firactions. 
Chromatography of peak HEP.Pl yielded a flow-through (HEP.P1-ATP-FT) and a 
bound fraction (HEP.Pl-ATP-Bound) (Fig. 13B). Binding of peak HEP.P2 to the 
20 ATP column could not be detected, though DNA polymerase activity was recovered 
in the flow-through. 

The HEP .PI -ATP-Bound fractions from the ATP agarose 
chromatographic step were fiirther purified by anion exchange over monoQ. The 
HEP.P1 -ATP-Bound fractions were diluted wdth buffer A to approximately the 
25 conductivity of bxrffer A plus 25 mM NaCl and applied to a 1ml monoQ column 

equilibrated in Buffer A. DNA polymerase activity eluted in the flow-through and in 
two resolved chromatographic peaks (MONOQ peakl and peak2) (Fig. 13C). Peak 2 
was by far the major source of DNA polymerase activity. Westem analysis using 
rabbit antibody directed against the E. cali a subunit confirmed presence of the a 
30 subunit in the second peak (see the Westem analysis in Fig. 14B). Antibody against 
the E. call x subunit also confirmed the presence of the x subunit in the second peak. 
Some reaction against a and x was also present in the minor peak (first peak). The 
Coomassie Blue SDS polyacrylamide gel of the MonoQ fi-actions (Fig, 14A) showed 
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a band that co-migrated with E. coli a and was in ihc same postidn as the ^tibpdy 
reactive material (antibody against E. coli a). Also present are bands corresponding 
to t, y, 6, and 5'. These subunits, along with B, are all that is necessary for rapid and 
processive synthesis and primer extension over a long (> 7 kb) stretch of ssDNA in 
5 the case d/E. coli DNA Polymerase III holoenzyme. 

The Pol Ill-type enzyme purified from TJh. may be a Pol III*-like 
enzyme that contains the DNA polymerase and clamp loader subuits (i.e., like the Pol, 
HI* of E. coli). The evidence for this is: 1 ) the presence of dnaX and dnaE gene 
products in the same column fractions as indicated by Westem analysis (see above); 

10 2) the ability of this enzyme to extend a primer around a 7.2 kb circular sisDNA upoii 
adding only B (see Example 16); 3) stimulation of Pol III by adding B on linear DNA, 
indicating B subunit is not present in saturating ampunts (see Example 15); and 4) the 
presence of x in T.th. which may glue the polymerase and clamp loader into a Pol III* 
as in E. coli; and 5) the comigration of a with subunits x, y, 6 and 8' of the clamp 

15 loader in the column fractions of the last chromatographic step (MonoQ, Fig. 14A). 

Micro-sequencing of T. th DNA Polymerase III a subimit 

The a subunit from the purified 7! /A DNA polymerase in 
(HEP.P1 .ATP-Bouhd.MONOQ peak2) was blotted onto PVDF membrane and was 
20 cut out of the SDS-PAGE gel and submitted to the Protein-Nucleic Acid Facility at 
Rockefeller University for N-terminal sequencing and proteolytic digestion, 
purification and microsequencing of the resultant peptides. Analysis of the a 
candidate band (Mw l30kD) yielded four peptides, two of which (TTHl, TTH2) 
showed sequence similarity to a subunits from various bacterial sources (see Fig. 1 5). 
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EXAMPLE 10 



Identification of ihc Thermus tkermophilus dnaE gene encoding the a subunit nf 
DNA polymerase III replication enzyme 
30 Cloning of the dnaE gene was started with the sequence of the TTH l 

peptide from the purified a subunit (FFIEIQNHGLSEQK) (SEQ. ID. No. 61). The 
fragment was aligned to a region at approximately 1 80 amino acids downstream of 
the N-termini of several other known a subunits as shown in Fig. 15. The upstream 
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33mer f 5'-GT GGGATGC GTGGTTCTGGATCTCGATGAAGAA-3'V f SEQ. ID. 
No. 31) consists of a BamHI site within the first 9 nucleotides (underlined) and the 
sequence coding for the following peptide HGLSEQK on the complementary strand. . 
The downstream 29mer (5'-GT GGGATCCA GGGSCTSTCSGAGCAGAAG-3'V 
5 (SEQ. ID. No. 32) consists of a BamHI site within the first 9 nucleotides (vmderlined) 
. and the following sequence coding for the peptide FFIEIQNH (SEQ. ID. No. 62); 

These two primers were directed away from each other for the piirpose 
of perfoming inverse PGR (also called circular PGR). The amplification reactions 
contained 1 Ong r./Zt genomic DNA (that had heeh cut and religated with Xmal), 6.5 
10 mM of each primer, iii a volume of 100 ^1 of Vent polymerase reaction, mixture 
containing 10 \xl ThenrioPol Buffer, 0.5 mM of each dNTP and 0.25 mM MgS04. 
Amplification y/as performed using the following cycling scheme: 

1. 4 cycles of: 95.5**C - 30 sec., 45^C - 30 sec., 75^C - ^ rimL 

2. 6 cycles of: 95.5^C- 30 sec , SO^C - 30 sec., 75^C - 6 min. 
15 3.30 cycles of: 95.5°C - 30 sec, 52.5^C - 30 sec, 75^C - 5 min. 

A 1 .4kb fragment was obtained and cloned into pBS-SK:BamHI (i.e. pBS-SK 
(Stratragene) was cut with BamHI). This sequence was bracketted by the 29mer 
primer on both sides and contained the sequence coding for the N-terminal part of the 
subunit up to the peptide used for primer design. 

20 To obtain further dnaE gene sequence, the TTH2 peptide was used. It 

was aligned to a region about 600 amino acids from the N-tennini of the other known 
subunits(Fig: 15B). 

The upstream 34mer 
f5'-GC GGGATCCT CAACGAGGACCTCTCCATCTtCAA-3'^ fSEQ. ID. No ■ 

25 consists of a BamHI site within the first 9 nucleotides (xmderlined) and the sequence 
from the end of the fiagment previously obtained. The downstream 35mer 
(5'-GCGGGATCCTTGTCGTCSAGSGTSAGSGCGTCGTA-3') (SEQ. ID. No. 34) 
consists of a BamHI site within the first 9 nucleotides (imderlined) and the following 
sequence coding for the peptide YDALTLDD (SEQ. ID. No. 63) on the 

30 complementary strand. The amplification reactions contained 10 ng T./Zi: genomic 
DNA, 0.5 mM of each primer, in a volume of 100 |il of Vent polymerase reaction 
mixture containing 10 \i\ ThermoPol Buffer, 0.5 mM of each dNTP and 0.25 mM 
MgS04. Amplification was perfomied using the following cycling scheme: 
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1. 4 cycles of: 95.5^C - 30 sec, 45°C - 30 sec, 75^C - 8 min, 

2. 6 cycles of: 95.5^C - 30 sec, 50°C - 30 sec, 75^C - 6 min. 
3. 30 cycles of: 95.5°C - 30 sec., 55°C -r 30 sec., 75°C - 5 min. 

A 1 .2kb PCR fragment was obtained and cloned into pUC 1 9:BamHI. The fragment 
5 was bracketted by the downstream primer on both sides and contained the region 
overlapping in 56 bp with the fragment previously cloned. 

To obtain yet more dnaE sequence, the following primers were used. 
The upstream 39mer 

(3'-GTGTGGATCCTCGTCCCCCTCATGCGCGACGAGGAAGGG-5') (SEQ. ID. 
10 Nos. 35 and 1 14) consists of a BamHI site within the first 10 nucleotides (underlined) 
and the sequence from the end of the fragment previously obtained. The downstream 
27mer (5'-GTG TGGATCC TTCTTCTTSCCCATSGC-3'^ (SEQ. ID. No. 36) consists 
of a BamHI site within the first 10 nucleotides (underlined), and the sequence coding 
for the peptide AMGKKK (SEQ. ID. No. 64) (at position approximately 800 residues 
15 from the N terminus) on the complementary strand. The AMGKKK (SEQ. ID. 

No. 64) sequence was chosen for primer design as it is highly conserved among the 
known gram-negative a subunits. The amplification reactions contained 10 ng TJh 
genomic DNA, 0.5 mM of each primer, in a voliime of 100 |il of Taq polymerase 
reaction xnixture containing 10 (il PCR Buffer, 0.5 mM of each dNTP and 2,5 mM 
20 MgCb. Amplification was performed using the following cycling scheme: 
1.3 cycles of: 95.5^*0 - 30 sec, 45**C - 30 sec, 72**C - 8 min. 

2. 6 cycles of: ?4.5^C - 30 sec, 55°C - 30 sec, 72**C - 6 min. 

3. 32 cycles of: 94.5^C - 30 sec, 50^C - 30 sec, 72^C - 5 min. 

A 2.3kb PCR fi-a^ent was obtained instead of the expected 0.6 kb firagment. BamHI 
25 digestion of the PCR product resuhed in three fi-agments of 1 . 1 kb, 0.7kb and O.Skb. 
The 1 . 1 kb fingment was cloned into pUC 1 9:BamHI. It turned out to be the one 
adjacent to the firagment previously obtained and contained the dnaE sequence right 
up to the region coding for the AMGKKK (SEQ. ID. No. 64) peptide, but was 
disrupted by an intron just upstream of this region. The sequence that follows this 
30 was amplified from the 2.3kb original PCR product using the same conditions and 

cycling scheme as for the 2.3kb fragment. The downstream primer was the same as in 
the previous step. The upstream 27mer 

(3'-GTGTGGATCCGTGGTGACCTTAGCCAC-5') (SEQ. ID. Nos. 37 and 115) 
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consisted of a BaihHI site within the first 9 nucleotides (underlined) and the sequence 
from the end of the 1 . Ikb fragment previously described. 

The expected 1 .2kb PGR fragment was obtained and cloned into 
pUC19:Smai. This fra^gment coded for the rest of the intein and the end of it was used 
5. to obtain the next sequence of ^//2a£ downstream of this region. The upstream 3 Omer 
(3'.TTCGTGTCCGAGGACCTTGTGGTCCACAAC-5') (SEQ. ID. Nos. 38 and 1 16) 
was a sequence from the end of the infron. The downstream 23mer 
(5'-CCAGAATCGTCTGCTGGTCGTAG-30 (SEQ. ID. No. 39) was the 
from the end of the dhaE gene of D,rad. (coding on the complementaiy strand for the 

1 0 region sli^tly homologous in the distantly related a subunits and possibly highly 
homologous between T.th, and D.rad. a subunits). The amplification reactions 
contained 1 0 ng J. tk genomic DN A, 0.5 mM of each primer, in a volume of 1 00 nl of 
Vent polymerase reaction mixture containing 10 \xl ThermoPol Buffer, 0.5 mM of 
each dNTP and 0.1 mM MgS04. Amplification was performed using the following 

15 cycling scheme: 

1.3 cycles of: 95.5^C - 30 sec, 55**C - 30 sec,, 75^C - 8 min. 
2. 32 cycles of: 94.5**C - 30 sec., 50^C - 30 sec., 75*^0 - 5 min. 
A 2.5kb PGR fiagment was obtained and cloned into pUCl 9:SmaI. This firagment 
contained the dnaE sequence coding for the 300 mino acids next to the AMGKKK 
20 (SEQ. ID. No. 64) region disrupted by yet a second intein inside another sequence 
that is conserved among the known a subimits (FNKSHS AA Y) (SEQ. ID. No. 65). 

To obtain the rest of the dnaE gene the uplstream 1 9mer 
(5'-AGCACCCTGGAGGAGCTTC-30 (SEQ. ID. No. 40) from the end of the known 
dnaE sequence was used. The downstream primer was: 
25 5'-CATGTCGTACTGGGTGTAC-3' (SEQ. ID. No. 41). The amplification reactions 
contained 10 ng T.th genomic DNA, 0.5 mM of each primer, in a volimie of 100 |il of 
Vent polymerase reaction mixture containing 10 ^1 ThermoPol Buffer, 0.5 mM of 
each dNTP and 0,1 mM MgS04. Amplification was performed using the following 
cycling scheme: 

30 1.3 cycles of: 95.5^G - 30 sec, 55°C - 30 sec, 75°C - 8 min. 

2. 32 cycles of: 94.5^0 - 30 sec, 50^C - 30 sec, 75^C - 5 min. 
A 1 .Okb fragment bracketed by this upstream primer was obtained. It contained the 3' 
end of Hie dnaE gene. 
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EXAMPLEll 

Cloning and Expressi on of the Thermus thermovhilus dnaO gene encoding the g 
5 subunit of DNA polymerase III replication enzyme 

Cloning of dnaO 

The dnaQ gene of K coli and the corresponding region of PolC of B. 

subtilis, evolutionary divergent organisms, share approximately 30% identity. 
1 0 Comparison of the predicted amino acid sequences for DnaQ (e) ofE. coli and PolC 

of Bsubtilis revealed two highly conserved regions (Fig. 17). Within eath of these 

regions, a nine amino acid sequence was used to design two oligonucleoti4e primers 

for use in the polymerase chiain reaction. 

The regions highly conservative among Pol III exonucleases were 
i s chosen to design the degenerate primers for the amplification of a T ih dnaQ internal 

fragment (see Fig. 17). DNA oligonucleotides for amplification of TJh, jgenomic 

DNA weie as follows. The upstream 27mer 

(5'-GTSGTSNNSOACNNSGAGACSACSGGG-3' (SEQ. ID. No. 42)) encodes the 
following sequence (VyXDXETTG) (SEQ. ID. No. 66). The downstream 27mer 

20 (5'-GAASCCSNNGTCGAASNNGGCGTTGTG.3') (SEQ. ID. No. 43) encodes the 
sequence HNAXFDXGF (SEQ. ID. No. 67) on the complementary strand. The 
amplification reactions contained 10 ng T.th. genomic DNA, 0.5 mM of each primer, 
in a volume of 1 00 ^il of Vent polymerase reaction mixture containing 1 0 nl 
ThermoPol Buffer, 0.5 mM of each dNTP and 0.5 mM MgS04. Amplification was 

25 perfonned iising the following cycling scheme: 

1. 5 cycles of: 95,5^C - 30 sec, 40°C - 30 sec, 72^C - 2 min. 

2. 5 cycles of: 95.5^C ^ 30 sec, 45^C -^ 30 sec, 72**C - 2 mm. 

3. 30 cycles of: 95.5**C - 30 sec, 50^C - 30 sec, 72^C - 30 min. 
Products were visualized in a 1 .5 % native agarose gel. A fragment of the expected 

30 size of 270 bp was cloned into the Smal site of pUCl 9 and sequenced with the 

Circumvent Thermal Cycle DNA sequencing kit accordinig to the manufacturer's 
instructions (New England Biolabs). 
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To obtain further sequence of the dnaQ gene, genomic DNA was 
digested with either mhol, BamHIi Kpnl or Ncol. These restriction enzymes were 
chosen because they cut T.th. genomic DNA frequently. Approximately 0.1 |ig of 
DNA for each digest was ligated by T4 DNA ligase in 50 ^il of ligation buffer (50 
mM Tris-HCl (pH 7.8), 10 mM MgCb, 10 mM dithiothreitol, 1 mM ATP, 25 mg/ml 
bovine serum albumin) ovemight at 20°C. The ligation mixtures were used for 
cicularPCR. 

DNA oligonucleotides for amplification of T. th. genomic DNA were 
the following. The upstream 27mer 

(5'.CGGGGMCCACCTCAATCACCTCGTGG-3') (SEQ. ID, No. 44) consists of a 
BamHI site within the first 9 nucleotides (underlined) and the sequence 
complementary to 42-61 bp region of the previously cloned ^na^ fragment. The 
downstream 3Qmer (5'-iGGGGG^TCCGCCACCTTGCGGCTCCGGGTG-3') (SEQ. 
ID. No. 45) consists of a BamHI site within the first 9 nucleotides (underlined) and 
the sequence corresponding to 240-261 bp region of the dnaQ fragment (see Fig. 17). 

The amplification reactions contained 1 ng T.th. genomic DNA (that 
had been cut with Ncol and religated into circular DNA for circular PGR), 0.4 mM of 
each primer, in a volume of 100 fil of Vent polymerase reaction mixture containing 
10 ^l ThermoPol Buffer, 0.5 mM of each dNTP, 0.5 mM MgS04, and 10% DMSO. 
Circular amplification was performed using the following cycling scheme: 

1.5 cycles of: 95.5°C - 30 sec, 50*^0 - 30 sec, 72^C - 8 min. 

2. 35 cycles of: 95.5^C - 30 sec, 55°C - 30 sec, 72^C - 6 min. 

3.72^C-10min. 

A 1.5 kb fragment was obtained arid cloned into the BamHI site of the pUC19 vector. 
Partial sequencing of the fragment revelled that it contained the dnaQ regions 
adjacent to sequences correspondinjg to the PGR primers and hence contained the 
sequences both upstream and downstream of the previously cloned dnaQ fragment. 
One of Ncol sites turned out to be approximatly 300 bp downstream of the end of the 
first cloned dnaQ sequence and hence did not include the 3' end of dnaQ. To obtain 
the 3' end, another inverse PGR reaction was performed. Since an Apal restiction site; 
was recognized within this newly sequenced dnaQ fragment, the circular PGR 
procedure was performed using as template an Apal digest of T.th. genomic DNA that 
was ligated (circularized) under the same conditions as described above. 
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DNA oligonucleotides for amplification of the Apal/religated Tjh 
genomic DNA were as follows. The upstream 3 Imer 

(5'-GCGC TCTAGA CGAGTTCCCAAAGCGTGGGGT-3') (SEQ. ID. No, 46) 
consists of a mbal site within the first 10 nucleotides (underlined) and the sequence 
5 complenientary to the region downstream of the Apal restriction site in the newly 
sequenced fiagment. The downstream 25 mer 

(5'-CGCGlCIAGATCACCTGTATCCAGA-3') (SEQ. ID. No. 47) consists of a 

Xbal site within the first 10 nucleotides (underlined) and the sequence corresponding 

to another region downstream of the Apal restriction site in the newly sequenced 
1 0 dnaQ firagment. The 1.7 kb PGR fi:agment was cloned mto the Xbal site; of the 

pUC19 vector and partially sequenced. The sequence of ^^aig, and the protein 

sequence of the e subimit encoded by it, is shown in Fig. 18. 

The dnaQ geine is encoded by an open reading fi^e of 209 (or 190 

de]pending on which Val is used ais the initiating residue) amino acids in length 
15 (23598.5 kDa - or 21383.8 kDa for shorter version), similar to the length of the E, coli 

e subunit (243 amino acids, 27099.1 kDa mass) (see Fig. 17). 

The entire amino acid sequence of the 8 subimit predicted fi-om the 

TJh. dnaQ gene aligns with the predicted amino acid sequence of the dnaQ genes of 

other organisms with only a few gaps and insertions (the first two amino acids, and 
20 four positions downstream) (Fig. 17). The consensus motifs VVXDXEJTG (SEQ. 

ID. Nos. 66 and 68), HNAXFDXGF (SEQ. ID. No. 67), and HRALYD (SEQ. ID. 

No. 70), characteristic for exonucleases, are conserved. Overall, the level of aihino 
. acid identity relative to most of the known z subunits, or corresponding proofi'eading 

-exonuclease domains of gram positive PolC genes is approximately 30%. Upstream 
25 of start 1 (Fig. 1 7) there were stop codons in all three reading firames. 

Expression of dnaO 

The dnaQ gene was cloned gene into the pET24-a expression vector in 
two steps. First, the PGR fi^agment encoding the N-terminal part of the gene was 
30 cloned into the pUCl 9 plasmid, containing the Apal inverse PGR fragment into 

Ndel/Apal sites. DNA oligonucleotides for amplification of T.th genoniic DNA were 
as follows. The upstream 3 3mer 

(5*-GCGGCGCATMGGTGGTGGTCCTGGACCTGGAG-3') (SEQ. ID. No. 48) 
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consists of an Ndel site within the first 12 nucleotides (underlined) and the begining 
of the ^/2a(2 gene. The downstream 25 mer 

(5'-CGCGTCTAGATCACCTGTATCCAGA-30 (SEQ. ID. No. 49), already used for 
Apal circular PGR, consists of an Xbal site within the first 1 0 nucleotides 
5 (underlined) and the sequence corresponding to the region downstream of the Apal 
restriction site: The 2.2 kb Ndel/Sall fragment was then cloned into the Ndel/Xhol 
sites of the pET16 vector to produce pETZA-siiclnaQ. The e subvinit was e^qpressed in 
the BL2 1 /LysS strain transformed by the pET24-a:dpiag plasmid. 

10 EXAMPLE 12 

The Thermus thermophilus dnaN aene encoding the fi subunit of DNA TX)lymerase III 
replication enzvme 

15 Strategy of cloning dnoN hy use of dnoA 

DnaN proteins are highly divergent in bacteria making it difficult to 
clone them by homology. The level of identity between DnaN representatives firom 
coli and 5. subtilis is as low as 1 8%. These 1 8% of identical amino acid residues 
are dispersed through the proteins rather then clustering together in conservative 

20 regions, further complicating use of homology to design PGR primers. However, one 
feature of dnaN genes among widely different bacteria is their location in the 
chromosome. They appear to be near the origin, and immediately adjacent to the 
dnoA gene. The ^aA genes show good homology among different bacteria and, thus, 
dnoA was first cloned in order to obtain a DNA probe that is likely near dnaN. 

25 

Identification of dnoA and dnaN 

The dnoA genes of E, coli and A subtilis share 58% identity at the 
amino acid sequence level within the ATP-binding domain (or among the 
representatives of gram-positive and gram-negative bacteria, evolutionary divergent 
30 organisms). Comparison of the predicted amino acid sequences encoded by dnoA of 
E. coli and B. subtilis revealed two highly conserved regions (Fig. 19). Within each 
of these regions, a seven amino acid sequence was used to design two oligonucleotide 
primers for use in the polymerase chain reaction. The DNA oligonucleotides for 
amplification of TJh genomic DNA were as follows. The upstream 20mer 
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(5'-GTSCTSGTSAAGACSCACTT-3') (SEQ. ID. No. 50) encodes the following 
sequence: VLVKTHL (SEQ. ID. No. 69). The downstream 21 mer 
(5'-SAGSAGSGCGTTGAASGTGTG-3', where S is G or C) (SEQ. ID. No. 51) 
encodes the sequence: HTFNALL (SEQ. ID. No. 71), on the complementary strand. 
5 The amplification reactions contained 1 0 ng T.th. genomic DNA, 0.5 mM of each 
primer, in a volume of 100 ^l of Vent polymerase reaction mixture containing 10 jil . 
ThermoPol Bufifer, 0.5 mM of each dNTP and 0.5 mM MgS04. Amplification was 
performed using the following cycling scheme: 

1.5 cycles of: 95.5*C - 30 sec., 45°C - 30 sec, 75''C - 2 min. 

10 2. 5 cycles of: 95.5°C - 30 sec, 50°C - 30 sec, 75»C - 2 min. 

3. 30 cycles of: 95.5»C - 30 sec, 52'»C - 30 sec, 75°C - 30 min. 
Products were visualized in a 1 .5% native agarose gel. A fragment of the expected 
size of 300 bp was cloned into the Smal site of pUCl 9 and sequenced with the 
CircimiVent Thermal Cycle DNA sequencing kit (New England Biolabs). 

15 To obtain a larger section of the T. th. dnaA gene, genomic DNA was 

digested with either Haell, HindHI, KasI, Kpnl, Mlul, Ncol, NgoMI, Nhel, Nsil, 
PaeRTI, PstI, SacI, SaU, Spel, SphI, StuI, or Xhol, followed by Southern analysis in a 
native agarose gel. The filter was probed with tiie 300 bp PGR product radiolabeled 
by random priming. Four different restriction digests showed a single fiiagment of 

20 reasonable size for further cloning. These were, KasI, NgoMI, and StuI, all of which 
produced $:agments of about 3 kb, and Ncol that jnoduced a 2kb fi-agment. Also, a 
Kpnl digest resulted in two fi^gments of about 1 .5 kb and 1 0 kb. 

Genoinic DNA digests using either NgoMI and StuI were used to 
obtain the dnoA gene by inverse PGR (also referred to as circular PGR). In this 

25 procedure, 0.1 ng of DNA from each digest was treated separately with T4 DNA 
ligase in 50 |il of ligation buffer (50 mM Tris-HCl (pH 7.8), 10 mM MgCb, 10 mM 
dithiothreitol, 1 mM ATP, 25 mg/ml bovine serum albumin) overnight at 20''C. This 
results in cncularizing the genomic DNA fiagments. The ligation mixtures were used 
as substrate in inverse PGR. 

30 DNA oligonucleotides for amplification of recircularized T.th. 

genomic DNA were as follows. The upstream 22mer was 

(5'-CTCGTTGGTGAAAGTTTCCGTG-3') (SEQ. ID. No. 52), and the downstream 
24mer was (5'-CGTCCAGTTCATCGCCGGAAAGGA-3') (SEQ. ID. No. 53). The 
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amplification reactions contained 5 ng Tth genomic DNA, 0.5 \iM of each primer, in 
a volimie of 100 ^il of Taq polymerase reaction mixture containing \0 \x\ PGR Buffer, 
0.5 mM of each dNTP and 2.5 mM MgCb. Amplification was performed using the 
following cycling scheme: 
5 1. 5 cycles of: 95.0°C - 30 sec, 55^C - 30 sec, 72*^0 - 10 min. 

2. 35 cycles of: 95.5°C - 30 sec, 50°C - 30 sec, 72°C - 8 min. 
The PGR fiagments of the expected length for NgoMI and StuI treated and then 
ligated chromosomal DNA were digested with either BamHI or Sau3a and cloned into 
pUG19:BamHI and pUG19:(BamHI+SmaI) and sequenced with GircumVent Thermal 

1 0 Gycle DNA sequencing kit. The 1 .6kb (BamHI+BamH) fragment from the NgoMI 
PGR product contained a sequence coding for the N-terminal part of dndN^ followed 
by the gene for enolase. The 1 kb (Sau3a+Sau3a) fragment from the saihe PGR 
product included the stiart of dnaN gene and sequence characteristic of the origin of 
replication (i.e., 9mer DnaA-binding site sequences). The 0.6kb (BamHI+BamHI) 

15 fragment from the StuI PGR reaction contained starts for dnoA and gidA genes in 

inverse orientation to each other. The 0.4 kb (Sau3a-i-Sau3a) fragment from the same 
PGR product contained the 3 * end of the dnoA gene and DNA sequence characteristic 
for the origin of replication. 

This sequence information provided the beginning and end of both the 

20 dnoA and the dnaN genes. Hence, these genes were easily cloned from this 

information. Further, the dnaN gene was readily cloned and expressed in a pET24-a 
vector. These steps are described below. 

Gloning and sequence of the dnoA gene 
25 The dnoA gene was cloned for sequencing in two parts: from the 

potential start of the gene up to its middle and from the middle up to the end. For the 
N-terminal part, the upstream 27mer 

(5'.TGTGGGAAGAGGTTGTGGAGCAGATGG-3') (SEQ. ID. No. 54) was 20 bp 
downsteam of the potential start codon of the gene. The downstream 23mer 
30 (5'-TGCTGGCGTTCATCTTCAGGATG-30 (SEQ. ID. No. 55) was approximately 
from the middle of the dnoA gene. For the C-terminal part, the upstream 23mer 
(5'.GATCCTGAAGATGAAGGGGAGGA-3') (SEQ. ID. No. 56) was complementary 
to the previous primer. The downstream 25mer 
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(5'-AGGTTATCCACAGGGCjTCATGTGCA-3') (SEQ. id. No. 57) was 20 bp 
upstream the potential stop codon for the dnaA gene. The amplification reactions 
contained 10 ng TJh, genomic DNA, 0.5 ^iM of each primer, in a volume of 100 \xl of 
Vent polymerase reaction mixture containing \0 \xl ThermoPol Buffer, 0.5 mM of 
5 each dNTP and 0.5 mM MgS04. Amplification was performed using the following 
cycling scheme: 

1. 5 cycles of: 95.5**C - 30 sec., 55^C - 30 sec, 75°C - 3 min. 

2: 30 cycles of: 95.5**C - 30 sec., 50^C - 30 sec., 75^G ~ 2 mShi. 
Products were visualized in a 1 .0% native agarose gel. Fragments of the ejq>ected 
10 sizes of 750 bp and 650 bp were produced, and were sequenced using GircumVent 

Thennal Cycle DNA sequencing method (New England Biolabs). The nucleotide and 
amino acid sequences of dnoA and its protein product are shown in Fig. 20. Ilie 
DnaA protein is homologouis to the DnaA proteins of several other bacteria as shown 
in Fig. 19. 

15 

Cloning and expression of dnaN 

The full length dnaN gene was obtained by PCR from T th total DNA. 
DNA oligonucleotides for amplification of T.th dnaN were the following: the 
upstream 29mer (5'-GTGTGTCATATGAACATAACGGTTCCCAA-3') (SEQ. ID. 

20 No. 58) consists of an Ndel site within first 1 1 nucleotides (xmderlined), followed by 
the sequence for the istart of the dnaN gene; the downstream 29mer 
f5'-GCGC GAATTCT CCCTTGTGGAAGGCTTAG-3') (SEQ. ID. No. 59) consists of 
an EcoRI site within the first 10 nucleotides (underlined), followed by the sequence 
complementary tb a section just downstream of the dnaN stop codon. The 

25 amplification reactions contained 10 ng T,th genomic DNA, 0.5 \iM of each primer, 
in ia volume of 1 00 nl of Vent polymerase reaction mixture containing 1 0 (il 
Thermopol Buffer, 0.5 mM of each dNTP and 0.2 mM MgS04. Amplification was 
performed using the following cycling scheme: 

1. 5 cycles of: 95.0^C - 30 sec, 55°C - 30 sec, 75°C ~ 5 min. 

30 2. 35 cycles of: 95.5°C - 30 sec, 50°C - 30 sec, 75°C - 4 min. 

The nucleotide and anlino acid sequences of dnaN and the 6 subvmit, respectively, are 
shown in Fig. 21 . The Tth, B subunit shows limited homology to the fi subimit 
sequences of several other bacteria over its entire length (Fig. 22). 
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The approximately 1 kb dnaN gene was cloned into the pET24-a 
expression vector using the Ndel and EcoRI restriction sites both in ih&dnaN 
containing PGR product and in pEt24-a (Fig. 23). Expression of Tth B subunit was 
obtained under the following conditions: a fresh colony of B121(DE3) Kcoli strain 
5 was transformed by the pET24-a:dnaN plasmid, and then was grown in LB broth 

containing 50 mig/ml kanamycin at 3TC until the cell density reached 0.4 ODeoo. 'Hie 
cell culture was then induced for dnoN expression upon addition of 2 mM IPTG. 
Cells were harvested after 4 additional hours of growth under 37®C. The induction of 
the TJh fi subunit is shown in Fig. 24. 

1 0 Two liters of BL2 1 (DE3)pET JnaT/cells were grown in LB media 

containing 50 mg/ml ampicillin at 3T'C to an O.D. of 0.8 and then IPTG was added to 
a concentration of 2 mM. After a further 2 h at 3 7^C, cells were harvested by 
centriftigation and stored at -70^C. The following steps were performed at 4**C. Cells 
were thawed and resuspended in 40 ml of 5 mM Tris-HCl (pH 8.0), 1% sucrose, IM 

15 NaCl, 5 mM DTT, and 30 mM spermidine. Cells were lysed using a French Pressure 
cell at 20,000 psL The lysate was allowed to sit at 4*'C for 30 min. and then cell 
debris wais removed by centriftigation (Sorvall SS-34 rotor, 45 min. 1 8,000 rpm). The 
supernatant was incubated at 65**C for 20 minutes with occasional stirring. The 
resulting protein precipitate was removed by centriftigation as described above. The 

20 supernatant was dialyzed against 4 liters of buffer A containing 50 mM NaCl 

overnight. The dialyzed supernatant was clarified by centriftigation (35 ml, 150 mg 
total) and then loaded onto an 8 ml MonoQ column equilibrated in buffer A 
containing 50 mM NaCl. The column was washed with 5 column volumes of the 
same buffer and then eluted with a 120 ml gradient of buffer A plus 50 mM NaCl to 

25 buffer A plus 500 mM NaCl. Fractions of 2 ml were collected. Over 50 mg of TJh B 
was recovered in fractions 5-21. 
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EXAMPLE 13 

Identification and cloning of T. thermophilus holA 

5 A search of the incomplete T.th. genome database (www.g21.bio.mii- 

goettingen.de) showed a match to E. coli S encoded by holA. The sequence obtained 
fi-om the database was as follows (SEQ. ID. No. 185): 

TPKGKPLVRHLHsfRAKIU.GLRLPGGVAQYLA-SLEGDLEALERELEKLALLSP 
10 -PLTLEKVEKWALRPPLTGFDLVRSVLEKDPKEALLRLGRLKEEGEEPLRLL 
GALSWQFAIXARAFFIXREMPRPKEEDLARLEAHPYAAKKALL-EAAR^ 
EALKEALDALMEAEKRAKG-GKDPWLALEAAVLRLAR-PAGQPRVD 

Next, flie following PCR primers were designed fix>m the codon usage 
15 of r.r/i.: upstream 27mer (5*- GCC CAG TAG CTC GCC TCC CTC GAG GGG -3') 
(SEQ. ID. No. 1 86) and downstream 27mer (5'- GGC CCC CTT GGC CTT CTC 
GGC CTC CAT -3' (SEQ. ID. No. 1 87) to obtain a partial holA nucleotide sequence 
(SEQ. ID. No. 188): 

20 AGACTCGAGG CCCTGGAGCG GGAGCTGGA6 AAGCTTGCCC TCCTCTCCCC ACCCCTCACC 60 
CTGGAGAAGG TGGAGAAGGT GGTGGCCCTG AGGCCCCCCC TCACGGGCTT TGACCTGGTG 120 
CGCTCCGTCC TGGA6AAGGA CCCCAAGGAG GCCCTCCT6C GCCTCA06CG CCTCAGG6AG 180 
GAG66GGAG6 AGCCCCTCAS GCTCCTCX3GG GCCCTCTCCT GGCyVGTTOGC CCTCCTCGCC 240 
OSGGCCTTCT TCCTCCTCC6 GGAAAACCCC AGGCCCAAG6 AGGAGGACCT CX3COCGCCTC 300 

25 GAGGCCCACC CCTACGCCX3C CAAGAAGGCC A .331 

This sequence codes for a partial amino acid sequence of the T. th. 8 
subunit (SEQ. ID. No. 189): 

30 RLEALERELEKLALLSPPLTLEKVEKWALRPPLTGFDLVRSVLEKDPKEALL 
RLRRLREEGEEPLRLLGALSWQFALLARAFFLLRENPRPKEEDLARLEAHPYA 
AKKA 



35 



The DNA sequence obtained by PCR (SEQ. ID. No. 1 88) was used to 
design internal primers for inverted PCR. The upstream 3 1 mer (5'- 
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GTGGTGTCTAGACATCATAACGGTTCTGGCA-3') (SEQ. ID, NO. 190) 
introduced an Xbal site for cloning holA into a pGEX vector. The downstreani 27mer 
(5*-GAGGGCCACCACCTTCTCCACCTTCTC-3*) (SEQ. ID. No. 191) encodes holA 
sequence EKVEKWAL (aa residues 1 59-1 67 of SEQ. ID. No. 1 58) on the 
5 complementary strand. The amplification reactions contained 50ng T.th genomic 
DNA and 0.1 \M of each primer in a volume of 100|li1 of Vent polymerase reaction 
mixture contaming 10 |il ThermoPol Buffer, 2.5 mM of each dNTP, 2 mM MgS04 , 
and 1 0 i^l of formamide. Amplification was performed using the foUovsdng cycling 
scheme: 

10 .1.5 cycles of: 95**C - 30 sec, 65**C - 20 sec, 75°C - 5 min. 

2. 5 cycles of: 95°C - 20 sec, 58°C - 10 sec, 75°C - 5 min, 

3. 35 cycles of: 95°C - 20 sec, 50^C - 5 sec, 75°C - 4 min. 
Products were visualized in a 1 .0% native agarose gel. A fragment of 1 .5 Kb was gel 
purified and partially sequenced. 

15 A different set of primers were used to obtain the 3'-end of TJh. holA^ 

mcluding an upstream 25mer (5'-GTCCGTCCTGGAGAAGGACCCCAAG-3') 
(SEQ. ID. No. 192) which encoded the amino acid sequence SVLEKDPK from TJh. 
holA (aa residues 179-186 of SEQ. ID. No. 158), and a downstream 29mer (5 - 
CGCGAATTCAACGCSCTCCTCAAGACSCT-3' where S = C or G) (SEQ. ID. No. 

20 193) was not related to the holA sequence. The amplification reactions contained 
50ng TJh genomic DNA and 0.1 |iM of each primer in a volunie of 100 \i\ of Vent 
polymerase reaction mixture containing 10 ^il ThermoPol Buffer, 2.5 mM of each 
dNTP, and 1-2 mM MgS04 , and 10 ^1 of formamide. Amplification was performed 
using the following cycling scheme: 

25 1.5 cycles of: 95°C - 30 sec, 65^C - 20 sec, 75^0 - 5 min. 

2. 5 cycles of: 95^C - 20 sec, 55°C - 10 sec, 75**C - 5 min. 
3.35 cycles of: 95°C - 20 sec, 50°C - 5 sec, 75**C - 4 min. 
Products were visualized in a 1 .0% native agarose gel. A Augment of 1 .2 Kb was gel 
purified and partially sequenced to obtain the remainder of the TJh holA gene. 

30 The Z th holA gene was cloned into the Ndel/EcoRI sites in the pET24 

vector using a pair of primers. The upstream 3 Imer (5'- 

GACACTTAACATATGGTCATCGCCTTCACCG-3') (SEQ. ID. No. 194) contains 
a Ndel site within the first 1 5 nucleotides (underlined) and has a sequence 
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corresponding to 5' region of TJh holA. The downstream 38 mer (5- 
GTGTGTieMlICGGGTCAACGGGCGAGGCGGA 

No. 195) contains a EcoRI site within the first 12 nucleotides (underlined) and has a 
sequence complementary to the 3' end of gene. 

5 • ' • * " 

EXAMPLE 14 
Identification of Tth. holB encoding 8' subunit 

10 

To clone the ends of T. th holB gene, it was assumed that the order of. 
genes in 7%ermiij //i^rmo/7M/^ could be the same as in related Z)^ 
radiodurance. Multiple alignment of the upstream neighbor (probable 
phosphoesterase, DNA repair Rad24c related protein) revealed a conservative region 
15 close to the C-terminus of the protein sequence: 

Deinococcus radiodurance VI IjNPGS VGQ 
Methanococcus janaschii YL INPGS VGQ 
Thermotoga maritima LVLNPCSAGR 

20 

The D.rad. sequence was used to design an upstream 28mer primer (5 - 
CTGGTGAACCCGGGCTCCGTGGGCCAGC-3') (SEQ. ID. No. 199) that encodes 
the amino acid sequence LLVNPGSVGQ (SEQ. ID. No. 200) and a downstream 
27mer (5'.CTCGAGGAGCTTGAGGAGGGTGTTGGC-3') (SEQ. ID:No. 201) 
25 encodes the sequence ANTLLKLLE (SEQ. ID. No. 202) on the complementary 

strand. The amplification reactions contained 50ng T.th, genomic DNA and 6.1 jiM 
of each primer in a volume of 1 OQ\x\ of Deep Vent polymerase reaction mixture 
containing 10 ^l ThermoPol Buffer, 2.5 mM of each dNTP, 1 .5 mM MgS04 , and 
lO^il formamide. Amplification was performed using the following cycling scheme: 

30 

1.5 cycles of: 95^C - 30 sec, 68°C - 20 sec, 75^C - 3 min. 

2. 5 cycles of: 95°C - 20 sec, 63^C - 20 sec, 75°C - 3 min. 

3. 35 cycles of: 95*'C - 20 sec, 55°C - 1 0 sec, 75°C - 3 min. 



(SEQ. ID. No. 196) 
(SEQ. ID. No. 197) 
(SEQ. ID. No. 198) 
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Product was visualized in a 1 .0% native agarose gel as a single band of 0.7 Kb- The 
fragment Was purified and partiedly sequenced. 

5 Multiple alignment of the gene downstream of D.rad. identified the 

following conservative region: 

Deinococcus radiodurans GFGGVQLHAAHGYLLSQFLSPRHNVREDEYGG 

(SEQ. ID. No. 203) 

1 0 Caenorhabditis elegans GFDGIQLHGAHGYLLSQFTS PTTNKRVDKYGG 

• (SEQ. ID. No. 204) 

Pseudomonas aeruginosa GFSGVEIHAAHGYLLSQFLSPLSNRRSDAWGG 

(SEQ. ID. No, 205) 
Archaeoglobus fulgidus GFDAVQLHAAHGYLLSEF I S PHVNRRKDEYGG 

15 (SEQ. ID. No. 206) 

The fragment in bold was used to design primers, specifically the 
downstream primer, for cloning of the 3' region of the T.th, holB gene. The upstream 
30mer (5'-CATCCTGGACTCGGCCCACCTCCTCACCGA-3') (SEQ. ID. No. 207) 

20 encodes the amino acid sequence ILDSAHLLT (SEQ. ID. No. 208). The downstream 
33mer (5'- GAGGAGGTAGCCGTGGGCCGCGTGGAGCTCCAG-S') (SEQ. ID. 
No. 209) encodes the sequence VELHAAHGYLL (SEQ. ID. No. 210) on the 
complementary strand. The amplification reactions contained 50ng TJh genomic 
DNA and 0. 1 \xM of each primer in a volume of 1 OOjil of Deep Vent polymerase 

25 reaction mixture containing 10 ^il ThermoPol BuflFer, 2.5 mM of each dNTP, 2 mM 
MgS04 , and 10 |il DMSO. Amplification was performed using the followiiig cycling 
scheme: 

1 . 5 cycles of: 95°C - 30 sec, 70^C - 20 sec, 75^C - 4 min. 

2. 5 cycles of: 95*^0 - 20 sec, 66°C - 20 sec, 75°C - 4 min. 
30 3. 30 cycles of: 95^C - 20 sec, 60°C - 10 sec, 77^C - 4 min. 

Products were visualized in a 1 .0% native agarose gel as a single band of 1 .1 kb. The 
Kb fi-agment was gel purified and sequenced to provide the remainder of the holB 
gene encoding T.th. 8'. 



For protein expression, the T th holB gene was cloned into the pET24 
vector at the Nde:EcoR sites using a pair of primers. The upstream 32mer (5 - 
GGCTTTCCCATMSGCTCTACACCCGGCTCAC.30 (SEQ. ID. No. 211) contains 
a Ndel site within the first 1 5 nucleotides (underlined) and the sequence 
corresponding to the 5' region of T th holB. The downstream 29 mer (5 - 
GCGTGGATCCACGGTCATGTCTCTAAGTC-3') (SEQ. ID. No. 212) contains a 
BamHI site within the first 1 0 nucleotides (underlined) and a sequence 
complementary to the 3* end of the gene. 

EXAMPLE 15 

Alternate synthetic path in absence of clamp loader activity 

As discussed earlier, the Pol Ill-type enzyme of the present invention is 
capable of application and use in a variety of contexts, including a method wherein 
the clamp loader component that is traditionally involved in the initiation of enzyme 
activity, is not required. The clamp loader generally fimctions to increase the 
efficiency of ring assembly onto circular primed DNA, because both the ring and the 
DNA are circles and one must be broken transiently for them to become interlocked 
rings. In such a reaction, the clamp loader increases the efiBciehcy of opening the 
ring. 

The procedure described below illustrates the instance where the clamp 
loader need not be present. For example, the B clamp can be assembled onto DNA in 
the absence of the clamp loader. Particularly, the bxilk of primed templates in PGR 
reactions are linear ssDNA Augments that are primed at the ends. On linear primed 
DNA, the ring need not open at all. Instead, the ring can simply thread onto the end 
of the linear primed template (Bauer and Burgers, 1988; Tan et al, 1986; O'Day et al., 
1992; Burgers and Yoder, 1993). Hence, on linear primed templates, such as those 
generated in PGR, the beta clamp can simply slide over the DNA end. After the ring 
slides onto the end, the DNA polymerase can associate with the ring for enhanced 
DNA synthesis. 

Such "end assembly" is common among Pol Ill-type enzymes and has 
been denionstrated in yeast and human systems. Rings assembling onto linear DNA 
for use by their respective DNA polymerases are shown in the following example 



demonstrated in the E. coli bacterial system, in the himian system, and in the TJh 
system. 

The bxilk of the primed templates in PGR reactions are linear ssDNA 
fiagments that are primed at their ends. However, these end primed linear fiagments 
are not generated imtil after the jSrst step of PGR has already been performed. In the 
very first step, PGR primers generally anneal at intemal sites in a heat denatured 
ssDNA template. Primed linear templates are then generated in subsequent steps 
enabling use of this alternate path. For this first step, the clamp may be assembled 
onto an intemal site in the absence of the clamp loader using special conditions that 
allow clamp assembly in the absence of a clamp loader. 

For example, a set of conditions that lead to assembly of the clamp 
onto circular DNA (i.e., intemal primed sites) have been described in the protocol for 
the use of the bacteriophage T4 ring shaped clamp (gene 45 protein) without the 
clamp loader (Reddy et al., 1 993). In this case, polyethylene glycol leads to 
"macromolecular crowding" such that the clamp and DNA are pushed together in 
close proximity, leading to the ring self assembling onto intemal primed sites on 
circular DNA. Other possible conditions that may lead to assembly of rings onto 
intemal sites include use of a high concentration of beta such that use of heat or 
denaturant to break the dimeric ring into two half rings (crescents) followed by 
lowering the heat (or dilution or removal of denaturant) leading to rings assembling 
around the DNA. 

The ring shaped sliding clamps of E. coli and human slide over the end 
of linear DNA to activate their respective DNA polymerase in the absence of the 
clamp loader. This clamp loader independent assay is performed in the bacterial 
system in Fig. 25 A. For this assay, the linear template is polydA primed with 
oligodT. The polydA is of average length 4500 nucleotides and was purchased from 
SuperTecs. 01igodT35 was synthesized by Oligos etc. The template was prepared 
using 145^1 of 5.2 mM (as nucleotide) polydA and 22 nl of 1.75 mM (as nucleotide) 
oligodT. The mixture was incubated in a final volume pf 2100 \i\ T.E. buffer (ratio as 
nucleotide was 21:1 polydA to oligodT). The mixture was heated to boiling in a 1 ml 
Eppendorf tube, then removed and allowed to cool to room temperature. Assays were 
performed in a finaj volume of 25 ^il 20 mM Tris-Cl (pH 7.5), 8 mM MgCh, 5 mM 
DTT, 0.5 mM EDTA, 40 mg/ml BSA, 4% glycerol, containing 20 \\M [a-^^P]dTTP, 
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0-1 ^ig polydA-oligodT, 25 ng Pol III and, where present, 5 |xg of B subiinit. Proteins 
were added to the reaction on ice, then shifted to 37^C for S min. DNA synthesis was 
quantitated using DE81 paper as described (Rowen and Kdmberg, 1978). 

In the linear template assay, no ATP or dATP is provided and 
5 therefore, a clamp loader, even if present, is not active. Thus, the clamp (e.g., fi) can 
only stimulate the DNA polymerase provided the clamp threads onto the DNA (see 
diagram in Fig. 25). Hence, threading of the clamp is shown by a stimulation of the 
DNA polymerase. In lane 1 of Fig. 25 A, the DNA polymerase is incubated with the 
the linear DNA in the absence of the clamp, and lane 2 shows the result of adding the 

10 clamp. The results show that the clamp is able to thread onto the DNA ends and 
stimulate the DNA polymerase in the absence of ATP and thus, in the absence of 
clamp loading as well. 

This clamp loader independent assay is performed in the human system 
in Fig. 2SB. The assay reaction (25 vil) contains 50 mM Tris-HCl (pH=7.8), 8 mM 

15 MgC12, 1 mM DTT, 1 mM creatine phosphate, 40 ^g/ml bovine serum albumin, 0.55 
\ig human SSB, 1 00 ng PCNA (where present), 7 units DNA polymerase delta (1 xmit 
incorporates 1 pmol dTMP in 60 min.), 40 mM [a-^^P]dTTP and 0.1 ^g 
polydA-oligodT. Proteins were added to the reaction on ice, then shifted to 37®C for 
60 min. DNA synthesis was quantitated using DE81 paper as described (Rowen and 

20 Romberg, 1978). In lane 3, (Fig. 25) the DNA polymerase 6 is incubated with the 
linear DNA in the absence of the clamp, and lane 4 showes the result of adding the 
PCNA clamp. The results demonstrate that the clamp is able to thread onto the DNA 
ends and stimulate the DNA polymerase in the absence of ATP and thus, the absence 
of clanip loading. 

25 This claxnp loader independent assay is performed in the r.//i. system 

in Fig. 25C. The assay reaction is exactly as described above for use of the E. coli Pol 
III and beta system except the temperature is 60°C and here the Pol III is HEP.Pl 
T.th Pol in (0.5 ^1, providing 0.1 units where one tmit is equal to 1 pmol of dTTP 
incorporated in 1 minute under these conditions and in the absence of beta), and the 

30 beta subimit is 7 ^g T.th. B (from the MonoQ colimin). Proteins were added to the 
reaction on ice, then shifted to 37^C for 60 min. DNA synthesis was quantitated 
using DE81 paper as described (Rowen and Romberg, 1978). In lane 3 (Fig. 25C), 
the TTh Pol III is incubated with the linear DNA in the absence of the clamp, and 
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lane 4 shows the result of adding the TJh B clamp. The results demonstrate that the 
clamp is able to thread onto the DNA ends and stimulate the DNA polymerase in the 
absence of clamp loader activity. 

5 EXAMPLE 16 



Use of TJh Pol III in lone chain primer extension 

A characteristic of Pol Ill-type enzymes is their ability to extend a 
single primer for several kilobases around a long (e.g. 7 kb) circular single stranded 

10 DNA genome of a bacteriophage. This reaction uses the circular B clanip protein. 

For the circular fi to be assembled onto a circular DNA genome^ the circular fi must be 
opened, positioned around the DNA, and then closed. This assembly of the circular 
beta around DNA requires the action of the clamp loader, which uses ATP to open 
and close the ring around DNA. In this example, the 7.2 kb circular single strand 

15 DNA genome of bacteriophage M l 3mpl 8 was used as a template. This template was 
primed with a single DNA 57mer oligonucleotide and the Pol III enzyme was tested 
for conversion of this template to a double strand circular form (RFII). The reaction 
was supplemented with recombinant TJh. B produced in E. coli. This assay is 
sunimarized in the scheme at the top of Fig. 26. M13mpl8 ssDNA was phenol 

20 extracted from phage purified as described (Turner and O'Donnell, 1995). M13mpl8 
ssDNA was primed vvith a 57mer DNA oligomer synthesized by Oligos etc. The 
replication assays contained 73 ng singly primed M13mpl 8 ssDNA and 100 ng TJh 
B subimit m a 25 ^l reaction containing 20 mM TrisrHCl (pH 7.5), 8 mM MgGh, 40 
jig/ml BSA, 0.1 mM EDTA, 4% glycerol, 0.5 mM ATP, 60 ^M each of dCTP, dGTP, 

25 dATP arid 20 ^M a-^^P-TTP (specific activity 2,000-4,000 cpm/pmol). Either TJh 
Pol III from the Heparin, peak 1 (HEP.Pl; 5 |il, 0.21 imits where 1 unit equals 1 pmol 
nucleotide incorporated in 1 min.) or a non-Pol III from the Heparin peak 2 (HEP.P2; 
5 |il, 2.6 units) were added to the reaction. Reactions were shifted to 60° C for 5 
min., and then DNA synthesis was quenched upon adding 25 |il of 1% SDS, 40 mM 

30 EDTA. One half of the reaction was analyzed in a 0.8% native agarose gel, and the 
other half was quantitated using DE81 paper as described (Stud well and O'Donnell, 
1990). 
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The results of the assay are shown in Fig. 26. Lane 1 is the result 
obtained using ih& T.tk Pol III (HEP.Pl) which was capable of extending the primer 
around the ssDNA circle to form RFIL Lane 2 shows the result of using the non-Pol 
ni (HEP.P2) which was not capable of this extension and produced only incomplete 

5 DNA products (the result shown included 0.8 jJig E. coli SSB which did not increase 
the chain length of the product). In the absence of SSB, the same product was 
Observed, although the band contained more counts. The greater amount of total 
synthesis observed in lane 2 is due to the build up of immature products in a small 
region of the gel. The presence of inmiature proiducts in lane 1 is likely due to a 

10 contaminating polymerase in tiie preparation that can not convert the single primer to 
the full length RFII form. Alternatively, the presence of incomplete products in lane 1 
(Pol III type enzyme) is due to secondary structure in the DNA which causes the Pol 
III to pause. In this case it may be presumed that performing the reaction at higher 
temperature could remove the secondary structure barrier. Altematively, SSB could 

15 be added to the assay (although TJh SSB would be needed, because addition ofE, 

coli SSB was tried and did not alter the quality of the product profile). Generally, SSB 
is needed to remove secpndsgry structure elements firom ssDNA at 37*^0 for complete 
extension of primers by mesophilic Pol Ill-type enzymes. 

The assay described above was performed at 60*^0. The TJh Pol III 

20 HEP.Pl gained activity as the temperature was increased from 37**C to 60**C, as 

expected for an enzyme from a thermophilic source. The E, coli Pol III lost activity at 
60°C compared to as expected for an enzyme from a mesophilic source. 

EXAMPLE 17 

25 

Materials used in Examples 1 8-29 

Radioactive nucleotide were from Dupont NEN; vmlabeled nucleotides 
were from Pharmacia Upjohn. DNA oligonucleotides were syntliesized by Gibco 
BRL. M13mpl 8 ssDNA was purified from phage that was isolated by two successive 
30 bandmgs in cesium chloride gradients. Ml 3mpl 8 ssDNA was primed with a 30-mer 
(map position 68 1 7-6846) as described. The pET protein expression vectors and 
BL21 (DE3) protein expression strain of E. coli were purchased from Novagen. DNA 
modification enzymes were from New England Biolabs. Aquifex aeolicus genomic 
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DNA was a gift of Dr. Robert Huber and Dr. Karl Stetter (Regensburg University, 
Germany). Protein concentrations were determined by absorbance at 280nm using 
extention coefficients calculated from their known Trp and Tyr content using the 
equation E28tf^Tipm (5690 M;^ cm"* )+ Tyrn (1280 M"* cm'*). 

5 * ' 

EXAMPLE 18 

Purification of a Encoded bv dnaE 

The Aquifex aeolicus dnaE gene was previously identified (Deckert et 

10 al., 1 998). The dnaE was obtained by searching the Aquifex aeolicus genome with 
the amino acid sequence ofr./Zz a subunit (encoded by <*ia£). The gene was 
amplified from Aquifex aeolicus genomic DNA by PGR using the following primers: 
the upstream 37mer (5'.GTGTGTCATATGAGTAAG 
GATTTCGTCCACCTTCAGC-3 ') (SEQ. ID. No. 1 57) contains an Ndel site 

15 (underlined); the downstream 34mer (5'- 

GtGTG TGGATCG GGGGACTACTCGGAAGTAAGGG-3 ') (SEQ. ID. No. 158) 
contains a BamHI site (underlined). The PGR product was digested with Ndel and 
BamHI, purifed, and lijgated into the pET24 Ndel and BamHI sites to produce 
pETAadnaE. 

20 The pETAadnaE plasmid was transformed into the BL21 (DE3) strain 

of jB. coli. Gells were grown in 50L of LB containing lOOjig/ml of kanamycin, SmM 
MgS04 at 3TC to OD600 = 2.0, induced with 2mM IPTG for 20h at 20^G, then 
collected by centrifiigation. Cells were resuspended in 400ml 50mM Tris-HCl (pH 
7.5), 10% sucrose, IM NaGl, 30mM spermidine, 5mM DTT and 2mM EDTA. The 

25 following procedures were performed at 4^G. Gells were lysed by passing them twice 
through a French Press (15,000 psi) followed by centrifiigation at 13,000 rpm for 90 
min at 4'*C. In this protein preparation, as well as each of those that follow, the 
induced Aquifex aeolicus protein was easily discemible as a large band in an SDS 
polyacrylamide gel stained with Coomassie Blue. Hence, column fractions were 

30 assayed for the presence of the Aquifex aeolicus protein by SDS PAGE analysis, 
which forms the basis for pooling colunm fractions. 

The clarified cell lysate was heated to 65®C for 30 min and the 
precipitate was removed by centrifiigation at 13,000 rpm in a GSA rotor for Ih. The 
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supernatant (1 .4gm, 280ml) was dialyzed against buffer A (20mM Tris-HCl (pH 
7.5)), 10% glycerol, 0.5 mM EDTA, 5mM DTT) overnight, then diluted to 320ml 
with buffer A to a conductivity equal to 1 OOmM NaCl. The dialysate was applied to a 
1 50ml Fast Flow Q (FFQ) Sepharose column (Pharmacia) equilibrated in buffer A, 
5 and eluted with a 1 .5L linear gradient of 0-500mM NaCl in buffer A. Eighty firactions 
were collected. Fractions38-58(lg, 390ml) were pooled, dialyzed versus buffer 
overnight, and applied to a 250mi Heparin Agarose column (Bio-Rad) equilibrated 
with buffer A. Protein was eluted witfi a IL linear O-SmM NaCl gradient in buffer A. 
One hundred fractions were collected, jpractions 69-79 (320 mg in 200 ml) were 
10 pooled and dialyzed against buffer A containing 1 00 mM NaCl. The a preparation 
was aliquoted and stored frozen at -80°C (see Fig. 27). 

EXAMPLE 19 

15 Purification of 8 Encoded bv holA 

The Aquifex aeolicus holA gene was not previously identified by the 
genome sequencing group at Diyersa (Deckert et al., 1998). Aquifex aeolicus holA 
was identified by searching the Aquifex aeolicus genome with the amino acid 
sequence of the T.th 5 subunit (encoded by holA). The Aquifex aeolicus holA was 

20 amplified by PCR using the following primers: the upstream 36mer (5'- 

GTGTG TCATATGG AAACCACAATATTCCAGTTCCAG-3') (SEQ. ID. No. 159) 
contains an Ndel site (underlined); the downstream 39mer (5'- 
GTGTG TGGATCCT TATCCACCATGAGAAGTATTnTCAC-3^) (SEQ. ID. No. 
160) contains a BamHI site (xmderlined). The PCR product was digested with Ndel 

25 and BaniHI, purified, and ligated into the pET24 Ndel and BamHI sites to produce 
pETAaholA. 

The pETAaholA plasmid was transformed into £. coli strain BL21 
(DE3). Cells were grown in 50L of LB media containing lOQH g/ml kanamycin. Cells 
were grown at 37°C to ODaoo = 2.0, induced for 20h upon addition of 2mM IPTG, 
30 then collecte;d by centrifugation. Cells fi-om 25L of culture were lysed as described in 
Example 18. 

The cell lysate was heated to 65 ^'C for 30 min and the precipatate was 
removed by centrifiigation. The supernatant (650mg, 240ml) was dialyzed against 
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bufiFer A, adjusted to a conductivity equal to 160mM NaCl by addition of 40ml of 
buffer A, and applied to a 220ml Heparin Agarose column equilibrated in buffer A 
containing 1 OOmM NaCl. The column was eluted with 1 .OL linear gradient of 1 50- 
700 mM NaCl in buffer A. One hundred and four fractions were collected. Fractions 
5 45-56 were pooled (250mg, 2 1 0 ml), diluted with 23 0ml buffer A to a conductivity 
equal to 230mM!NaCl, then loaded onto a 100ml FFQ Sepharose column equilbrated 
in buffer A containing 1 50mM NaCl. The column was eluted with 200ml linear 
gradient of 1 50-750mM NaCl in buffer A; seventy-three fractions were collected. 
Fractions 16-38 were pooled (95mg, 40ml), aliquoted, and stored at -SO^'C (see Fig. 
10 27), . 

EXAMPLE 20 

Purification of 8' Encoded bv holB 

15 l^e y4grMz)fex aeo//cw5 /i<?/5 gene was previously identified by the 

genome sequencing facility at Di versa (Deckert et al., 1998). The Aquifex aeolicus 
holB sequence was obtained by searching the Aquifex aeolicus genome with the 
sequence of the Tjh, 5' (encoded by holE). The Aquifex aeolicus holB gene was 
amplified by PCR using the following primers: the upstream 39mer (5'- 

20 GTGTGTCMIMSGAAAAAGTTTTTTTTGGAAA AAACTCCAG-3') (SEQ. ID. 
No. 161) contains an Ndel site (vmderlined); the downstream 35mer (5'- 
GTGTGTGGAICCTTAATCCGCCTGAACGGCTAACG-3') (SEQ. ID. No. 162) 
contains a BiamHI site (underlined). The PCR product was digested with Ndel and 
BamHI, purified, and ligated into the pET24 Ndel and BamHI site to produce 

25 pETAaholB. 

The pETAaholB plasmid was transforaied into E, coli strain BL21 
(DE3). Cells were grown at 37*'C in 50L media containing lOO^g/ml kanamycin to 
ODeoo 2.0, then induced for 3h upon addition of 0.2mM IPTG. Cells were collected 
by centrifugation and were lysed using lysozyme by the heat lysis procedure (Wickner 
30 and Komberg, 1974). The cell lystate was heated to 65°C for 30 min and precipatate 
was removed by centrifugation. The supernatant (2.4g, 400ml) was dialyzed versus 
buffer A, then applied to a 220ml FFQ Sepharose column equilibrated in buffer A. 
Protein was eluted with a IL linear gradient of 0-500mM NaCl in buffer A; eighty 
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firactions were collected. Fractions 23-30 were pooled and diluted 2-fold with buffer 
A to a conductivity equal to lOOmM NaCl, then loaded onto a 200ml Heparin 
Agarose column equilibrated in buffer A. Protein was eluted with a IL linear gradient 
of 0-1 .OM NaCl in buflferA; eighty-foxir fractions were collected. Fractions 46-66 
5 were pooled (1 .3g, 395ml), dialyzed versus buffer A containing 1 OOmM NaCl, then 
aliquoted and stored frozen at -SO'^C (see Fig. 27) 

EXAMPLE 21 

10 Purification of x Encoded bv dnaX 

The Aquifex aeolicus dnaX gene was previously identified (Deckert et 
al., 1998). The dnaX gene sequence was obtained by searching the Aquifex aeolicus 
genome with the sequence of T.th. x subuhit (encoded by dnaX). The Aquifex 
aeolicus dnaX was amplified by PCR using the following primers: the upstream 

15 41mer f5'-GTGTG TCATATGA ACTACGTTCCCTTCGCGAGAAAGTACAG-3') 
(SEQ. lb. No. 163) contains an Ndel site (underlined); the downstream 36mer (5'- 
GTGTGTGGATCCTTAAAACAGCCTCGTGCCGCTGGA-3') (SEQ. ID. No. 164) 
contains a BamHI site (underlined). The PCR product was digested with Ndel and 
BamHI, pxirified, and ligated into the pET24 Ndel and BamHI sites to produce 

20 pETAadnaX. 

The pETAadnaX plasmid was transformed into E. colt strain BL21 
(DE3). Cells were grown in SOL LB containing 100 ng/ml kanamycin at 37°C to 
OD600 = 0.6, then induced for 20h at 20^C upon addition of IPTG to 0.2mM. Cells 
were collected by centrifiigation and lysed as described in Example 18: The clarified 

25 cell lysate was heated to 65°C for 30 min and the protein precipitate was removed by 
centrifiigation. The supernatant (l.lg in 340ml) was treated vsdth 0.228g/ml 
ammonium sulfate followed by centrifiigation. The x subunit remained in the pellet 
which was dissolved in buffer B (20mM Hepes (pH 7.5), O.SmM EDTA, 2mM DTT, 
10% glycerol) and dialyzed versus buffer B to a conductivity equal to 87mM NaCl. 

30 The dialysate (1073mg, 570ml) was applied to a 200ml FFQ Sepharose column 
equilibrated in buffer A. The column was eluted with a 1 .5L linear gradient of 0- 
500mM NaCl in buffer A; eighty fi*actions were collected. Fractions 28-37 were 
pooled (289mg, 138ml), dialyzed against buffer A to a conductivity equal to 82mM 
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NaCl, then loaded onto a 1 50ml column of Heparin Agarose equilibrated in buffer A. 
The column was eluted with a 900ml linear gradient of 0-500mM NaCl in buffer A; 
thirty-two fractions were collected. Fractions 15-18 (1 87mg, 1 1 0ml) were dialyzed 
versus buffer A, then aliquoted and stored at -80°C (see Fig- 27). 

5 

EXAMPLE 22 

Purification of B Encoded bv dnoN 

The Aquifex aeolicus dnoN gene was previously identified (Deckert et 

10 al., 1998). The dnoN sequence was obtained by searching the Aquifex aeolicus 
genome with the sequence of T.th. p subunit (encoded by dnaN). The Aquifex 
aeolicus dnaN g^ne was amplified by PGR using the following primers: the upstream 
33mer f5^-GTGTG TCATATG CGCGTTAAGGTGGACAGGGAG-3 ') (SEQ. ID. 
No. 165) contains an Ndel site (underlined); the downstream 36mer (5'- 

15 TGTG TCTCGAG T CATGGCTACACCCTCATCGGCAT-3 ') (SEQ. ID. No. 166) 
contains a Xhol site (underlined). The PGR product was digested with Ndel and 
BamHI, pxirified, and ligated into the pET24 Ndel and BamHI sites to produce 
pETAadnaN. 

The pETAadnaN plasixiid was transformed into E. coli strain BL21 
20 (DE3). Cells were grown in IL LB containing lOOmg/ml kanamycin at ST^'C to 

OD600 = 1 -0, then induced for 6h upon addition of 2mM IPTG. Cells were collected 
(7g) and lysed as described in Example 1 8. The cell lysate was heated to 65°C for 30 
min and the protein precipitate was removed by centrifiigation. The supernatant 
(39mg, 45ml) was applied to a 10ml DEAE Sephacel column (Pharmacia) 
25 equilibrated in buffer A. The column was eluted with a 1 00ml linear gradient of 0- 

500mM NaCl in bufferA; seventy-five fractions were collected. Fractions 45-57 were 
pooled (1 8.7mg), dialyzed versus buffer A, and applied to a 30ml Heparin Agarose 
column equilibrated in buffer A. The column was eluted with a 300ml linear gradient 
of 0-500niM NaCl in buffer A; sixty-five fi-actions were collected. Fractions 27-33 
30 were pooled (1 Img, 28ml) and stored at -80°C (see Fig. 27). 
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EXAMPLE23 

Purification of SSB Encoded bv ssb 

The Aquifex aeolicus ssb gene was previously identified (Deckert et 
5 al., 1998g). The ssb gene sequence was obtained by searching the Aquifex aeolicus 
genome with the sequence of T.th SSB (encoded by ssb). The Aquifex aeolicus ssb 
gene was amplified by PGR using the following primers: the upstream 47mer (5'- 
GTGTG TCATATGC TCAA TAAGGTTTTTATAATAGGAAGACTTACQGG-3') 
(SEQ. ID. No. 167) contains an Ndel site (imderlined); the downstream 39mer (5'- 
10 GTG TGGATCCT TA AAAAGGTATTTCGTCCTCTTCATCGG-3') (SEQ. ID. No! 
168) contains a BamHI site (imderlined). The PGR product was digested with Ndel 
and BamiHI, purified, and ligated into the pET16 Ndel and BamHI sites to produce 
pETAassb. 

The pETAassb plasmid was transformed into E. coli istrain BL21 
15 (DE3). Cells were grown in 6L of LB media containing 200^g/mlampicillin. Cells 
were grown at 37^C to OD6oo= 0.6, then induced at 15°C overnight in the presence of 
2mM IPTG and collected by centrifugation. Cells were lysed as described above in 
Example 18, except cells were resuspended in buffer C (20mM Tris-HCl (pH 7.9), 
SOOmMNaGl). 

20 The cell lysate was heated to 65**G for 30 min, then the precipitate was 

removed by centrifugation. The supernatant (1.4g, 190ml) was applied to 2Sml 
Chelating Sepharose column (Pharmacia-Biotech) charged with 50mM Nickel Sulfate 
and then equilibrated in buffer C containing SmM Imidazole. The colimm was eluted 
with a 300ml linear gradient of 5-lOOmM Imidazole in buffer G. Fractions of 4ml 

25 were collected. Fractions 8 1 -92 were pooled (-240mg in 48ml) and dialyzed 

overnight against 2L of buffer B containing 20pmM NaCl. The dialysate was diluted 
to a conductivity equal to 92mM NaCl using buffer A and then loaded onto an 8ml 
MonoQ column equilibrated in buffer A containing lOOmM NaCl. The colimm was 
eluted with a 120ml linear gradient of 1 00-500mM Imidazole in buffer A. Seventy- 

30 four fractions were collected. Fractions 57-70 were pooled (lOOmg, 25ml), aliquoted, 
and stored at -80^C (see Fig- 27). 
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EXAMPLE 24 

MonoO Preparation of t55' 

The 8 subunit (0.29mg) purified in Example 19 and 5' subunit 
5 (0.3 Img) purified in Example 20 were mixed in a volume of 2.8ml of buffer A at 
15®C. After 30min, the x subunit (O.Smg in 1.4ml), purified in Example 21, was 
added and the reaction was incubated a fiirther Ih at 15**C. The reaction was applied 
to a 1ml MonoQ colimm equilibrated in buffer A. The t65* complex elutes later than 
either x, 8 or 8' alone. Protein was eluted with a 32ml linear gradient of 100-500mM 

10 NaCl in buffer A; eighty fractions were collected. Analysis of the MonoQ fi-actions in 
a SDS polyacylamide gel shows a peak of x88' complex that elutes in fi^ctions of 
32-38 (see Fig. 28). The peak fi^ctioiis 850^g were stored at -80**C, This procedure 
can easily be scaled up. For example, a much larger amount of xS8' was constituted 
by following a similar protocol and using a 8ml MonoQ colimin, which yielded 9.6mg 

15 ofxSS'. 

EXAMPLE 25 

Constitution of ax88' Complex 

20 The reaction mixture contained 1 .2 mg asubunit (9nmol; 133,207 da) 

purified in Example 1 8, 0.41mg x subimit (7.5 nmol; 54,332 da) purified in 
Example 21, 0.41 mg 8 subxmit (10 nmol; 40,693 da) purified in Example 19, and 0.2 
mg 8' subxmit (9nmol; 29,000 da) purified in Example 20 in 1 .1ml buffer A. The a 
and X subxmit solutions were premixed in 871 j^l for 2h at 1 5°C before adding 8 and 8' 

25 subunit isolution, then the complete mixtxire was allowed to incubate an additional 
12 h at 15 ^C. The reaction may not reqxiire an order of addition, or these extended 
incubation times. The reaction mixture was concentrated to 200^x1 using a Centricon 
30 at 4^C, then applied to an FPLC Superose 6 HR 1 0/30 column (25ml) at 4°C 
developed with a continuous flow of buffer A containing lOOmM NaCl. After the 

30 first 216 drops (6.6ml), fi-actions of 7 drops each were collected. Fractions were 

analyzed on a SDS polyacrylamide gel stained with Coomassie Blue (Fig. 29). The 
analysis was repeated using the a subunit alone (Fig. 29). The results show that the 
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peak fractions of a shift to a considerably earlier position when x, 8 and 8' are present 
and a comigrates with t, S, and 8', when compared to the elution position of a alone, 
indicating that a assembles with x, 8 and 8' into a ax88' complex. 

5 EXAMPLE 26 

ax8S' Functions with the B Clamp 

Replication reactions were performed using circular Ml 3mpl 8 ssDNA 
primed with a synthetic DNA 90 mer oligonucleotide. Reactions contained 8.6|ig 

10 primed M13mpl8 ssDNA, 9.4^g S SB purified in Example 23, 1.0|ag axS8' prepared 
in Example 25, and 2.0p.g p subunit purified in Example 22 (when present), in 230|il 
of 20mM Tris-HCl (pH 7.5), 5mM DTT, 4% glycerol, 8mM MgCk, 0.5mM ATP, 
60^M each dATP and dGTP (buffer composition is for a final volume of 250|xl). 
Reactions were mixed on ice, then aliquoted into separate tubes containing 25^1 each. 

15 For each timed reaction, the mixture was brought to 65®C for 2 min before initiating 
syntheses upon addition of 2^1 of dCTP and a^^P-dTTP (final centrations, 60 and 
40nM, respectively). Aliquots were quenched at the times indicated in Fig. 30 upon 
adding 4|il of 0.25M EDTA, 1% SDS. Quenched reactions were then analyzed in a 
0.8% alkaline agarose gel. The results, illustrated in Fig. 30^ demonstrate that 

20 eflScient synthesis requires addition of the p subimit. Comparison with size standards 
in the same gel indicates an average speed of ~12S nucleotides; the leading edge of 
ihe product smear indicates a maximum speed of 375 nucleotides/s. 



25 



EXAMPLE 27 



Purification of Tth a subunit 

To obtain TJh, a subimit, 8 L of coli BL21(DE3) cells harboring 
pETtthalpha were grown to O.D. = 0.3 and induced upon adding IPTG. Cells were 
collected by centrifugation and resuspended in 200 ml 50mM Tris-HCl (pH 7.5), 10% 
30 sucrose, IM NaCl, 30mM spermidine, 5mM DTT and 2mM EDTA. The following 
procedures were performed at 4**C. Cells were lysed by passing them three times 
through a French Press (20,000 psi) followed by incubation at 4°C for 30 min and. 
then centrifugation at 1 8,000 rpm in an SS-34 rotor for 45 min at 4*^C. Induced 
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protein was less that 1% total cell protien but was discernible as a band that migrated 
in the appropriate position for its predicted molecular weight in an SDS 
polyacrylamide gel stained with Coomassie Blue. Hence, column fractions were 
assayed for the presence of the protein by SDS PAGE analysis, which forms the basis 
5 for pooling colunin fractions. 

iThe clarified cell lysate was heated to 65X for 30 min and the 
precipitate was removed by centrifugation. The supernatant (1 .4gm, 280ml) was 
dialyzed gainst buffer A (20mM Tris-HCl (pH 7.5), 1 0% glycerol, 0.5 mM EDTA, 
5mM DTT) overnight, then diluted to 320ml with buffer A to a conductivity equal to 

10 1 OOmM NaCl. The dialysate (approximately 1 50 mg) was applied to a 60ml DEAE 
Fast Flow Q (FFQ) Sepharose column (Pharmacia) equilibrated in buffer A, and 
eluted with a 600 ml linear gradient of Or500mM NaCl in buffer A. Fractions of 8 ml 
each were collected. The Tth. a subunit could be seen as a major band in several 
fractions, especially in fractions 26-30. In these peak fractions the Tth a subunit was 

15 approximately 20-30 percent pure. 

EXAMPLE 28 

Purification of Tth. e subunit 

20 The dnaQ gene was cloned into the pET16 expression plasmid using 

the Val withm the context "VGLWEW..." and transformed into E. coli (BL21(DE3). 
This pET plasmid places an N-terminal leader containing six histidines onto the 
expressed protein to facilitate purification via use of chelate afSnity chromatography. 
Twelve liters of cells were grown to an OD of 0.7 and induced with IPTG. Induced 

25 cells were collected by centrifiigation and resuspended in 150 ml of buffer G (20mM 
Tris-HCl (pH 7.9), 500mM NaCl). Cells were lysed by passing them two times 
through a French Press (20,000 psi) followed by incubation at 4*^C for 30 min and 
then centriftigation at 13,800 rpm in an SLA-1 500 rotor for 45 min at 4°C. Induced 
protein appeared greater than 5% total cell protien and was easily discernible as a 

30 band that migrated in the appropriate position for its predicted molecular weight in an 
SDS polyacrylamide gel stained with Coomassie Blue. Hence, colunm fractions were 
assayed for the presence of the protein by SDS PAGE analysis, which forms the basis 
for pooling column fractions. 
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Upon analyzing the precipitate from the cell lysis, and the supematent, 
it was determined that the epsilon subunit was insoluble and appeared in the 
precipitate. Therefore the cell pellet was resuspended in 100 ml of binding buffer 
containing 6M freshly deionized urea. This resuspension was then placed in 

5 centrifiige bottles and spun at 13,800 rpm for 45 min in the SLA-1500 rotor. The 
epsilon was in the supematent and was applied to a 25 ml Chelating Sepharose 
colmnn (Pharmacia-Biotech) charged with 50 mM Nickel Sulfate and then 
equilibrated in buffer C containing SmM Imidazole. The colunm was washed with 
two column volumes of buffer C, then washed with 5 column volumes of beffer C 

1 0 containing 80 mM Imidazole (final). Then the Tth epsilon was eluted with a 250 ml 
linear gradient of 60*1 000 mM Imidazole in buffer C. Fractions of 4ml were 
collected. Fractions 1 5-24 were pooled (--131 mg) and dialyzed overnight against 2L 
of buffer A containing 6M urea, but no NaCl or glycerol. The dialy^te was then 
loaded onto an 8ml MonoQ colmnn equilibrated in buffer A containing 6M urea. The 

1 5 column was eluted with a 120ml linear gradient of 0-500 mM NaCl in buffer A 

containing urea. Sixty five fi:actions were collected. The epsilon is approximately 
80-90 percent pxire at this stage. Fractions 13^17 were stored at -80°C. The epsilon is 
in xirea but is at a concentration of 5-1 0 mg/ml, and thus can be used with other 
proteins by diluting it such that the final urea concentration is less than 0.5 M. This 

20 level of urea does not generally denature protein, and should allow epsilon to renature 
for catalytic activity, 

EXAMPLE 29 

25 Temperature optimum of Aquifex and Thermits a subunit DNA polvmerases 

The temperature optimum of the alpha subvmits of the Aquifex and 
TTiermuj replicases was tested in the calf thymus DNA replication assay. In this 
experiment, the reactions were assembled on ice in 25 |il containing 2.5 |ig calf 
thymus activated DNA, and either 0.88 ug Aquifex a, or 0.6 |ag of the Thermus a, 

30 DEAE pool of peak fi*actions (obtained fi-om Examples 1 8 and 28, respectively) in 20 
mM Tris-HCl (pH 8.8), 8 mM MgCl2, 1 0 mM KCl, 1 0 mM (NH4)S04, 2 mM 
MgS04, 0.1% Triton X-100, 60 ^M each dATP, dCTP, dGTP, and 20 ^M a32p- 
dTTP. Reactons were shifted to either 30, 40, 50, 60, 70, 80, or 90°C, then stopped 
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after 5 minutes and spotted onto DE8 1 filters to qumititate DNA synthesis. The 
results, illustrated in Figs. 3 1 -32, show that these enzymes increase in activity as the 
temperature is raised. The Thermus a has a broad peak of activity from 70-80OC 
(Fig. 31), wWle the Aquifex a is maximal at 80*C (Fig. 32). The Aquifex a retains 
5 considerable activity at 90°C, whereas the Thermus a is nearly inactive at 90*'C, a 
result tfiat is consistent with the Wgher temperature at wWch the aeo/icus niay 

live relative to the TTier/wMS bacterium. 

EXAMPLE 30 

10 

Temperature optimum of Aauifex aT55'/g 

Aquifex a, P, t88', SSB and axSS' were tested for stability at different 
temperatures by incubating the protein in a solution, followed by performing a 
replication assay of the protein. Incubation was performed in 0.4 ml tubes under 
15 mmeral oil. The 5 jil reaction mixture contained: buffer B (20 mM trisrliCl (pH 
7.5), 5 mM DTT, 5 mM EDTA), and either: 0.352 ng of a (Fig. 33 A), 0.2 jig of p 
(Fig. 33B), 0.125 ^g x complex (Fig. 33C), 0.32 \ig SSB and 0.042 \ig primed 
M13mpl8 ssDNA (Fig. 33D), 0.82 \ig Pol III* (Fig. 33E). Reactions were incubated 
for 2 min. at either 70, 80, 85 , or 90°C in the presence of either 0.1% Triton X- 1 00 
20 (filled diamonds); 0.05% Tween-20 and 0.01% NP-40 (filled circles); 4 mM CaCb 

(filled triangles); 40% Glycerol (inverted filled triangles); 0.01% Triton X-100, 0.05% 
Tween-20, 0.01% NP-40, 4 mM CaCb (half-filled square); 40% Glycerol, 0.1% 
Triton X-100 (open diamonds); 40% Glycerol, 0.05% Tween-20, 0.01% NP-40 (open 
circles); 40% Glycerol, 4 mM CaCb (open triangles); 40% Glycerol, 0.01% Triton X- 
25 1 00, 0.05% Tween-20, 0.0 1 % NP-40, 4 mM CaCh (half-filled diamonds). After 

heating, reactions were shifted to ice and 20 ^1 of replication assay buffer was added 
followed by incubation for 1 .5 min at 70°C; 1 5 was then spotted onto a DE81 filter 
and DNA synthesis was quantitated. The replication assay buffer contained: 60 mM 
Tris-HCl (pH 9. 1 at 25°C), 8mM MgCb, 1 8 mM (NH4)2S04, 2 mM ATP, 60 ^M each 
30 of dATP, dCTP, dGTP, and 20 jaM [a'^^P] TTP (specific activity 1 0,000 cpm/pmol), 
and 0.264 ng primed Ml 3mpl 8 ssDNA. To assay for p, 0. 1 ng aT58' was added to 
the reaction. To assay x85', 0.9 ng P and 0.17 ng a were added to the reaction. To 
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assay for SSB, 0.17 ng E, coli P and 0.1 ng E. coli axSB' were added to the reaction 
followed by incubation for 1 .5 min at 37**C. To assay for CLxhb\ 0.9 ng P was added 
to the reaction. To assay a, the calf thymxis DNA replication assay was performed in 
the buffer as described above but 2.S |j.g activated calf thymus DNA was used instead 
5 of primed Ml 3mpl 8 ssDNA, no other replication proteins were added, and incubation 
was for 8 min at TO^'C. 
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This invention may be embodied in other forms or carried out in other ways without 
departing from the spirit or essential characteristics thereof. The present disclosure is 
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